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APPARATUS AND METHOD FOR A RELOCATABLE FILE FORMAT 
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LIMITED COPYRIGHT WAIVER 
A portion of the disclosure of this patent document 
contains material to which the claim of copyright 
10 . protection is made. The copyright owner has no 
objection to the facsimile reproduction by any person of 
the patent docximent or the patent disclosure, as it 
appears in the U.S. Patent and Trademark Office file or 
records, but reserves all other rights whatsoever. 

15 

BACKGROUND 
1 . Field of the Invention 

The invention relates to the loading of software 
into memory for execution by a computer system, and more 
20 particularly, to techniques and file formats which 
facilitate the same. 



2 . Description of Related Art 

Computer programs are typically written originally 
in source code in a computer language such as C or 
Pascal, or in an assembly language. To prepare the 
program for execution on a computer system, one or more 
source code modules are passed through a compiler (or 
assembler) which is specific to the language used and 
the computer system on which it will be executed, and 
which generates an object code file as its output. A 
linker routine, which is either a separate program or is 
part of the compiler, combines them into a single output 
file, known as an "executable" object code file. One or 
more executables are then loaded together into memory by 
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a loader program, and control is transferred to a start 
address to initiate program execution. 

An executable object code file typically includes) 
among other things, a header section which contains 
information about the structure of the file; one or more 
code sections which contains binary instructions which 
are directly executable by the system's CPU; one or more 
data sections; and a loader section, the contents of 
which are described below, 

A data section nypically contains data which was 
initialized by the compiler in response to the source 
code, descriptors describing various procedure pointers, 
as well as several other types of pointers. The various 
pointers which are contained in the data section may 
15 include some which refer to the address in memory of 
other data objects or of specific computer instructions. 
For example, a pointer may refer to specific objects in 
a code section, such as the entry point of a procedure. 
Other pointers in the data section may contain the 
addresses of other objects in the same data, section. 
{As used herein, an address may be real or virtual, 
depending on the computer system used) . Further, in 
systems where programs may be compiled into two or more 
executable files and subsequently loaded together, a 
data section in one file may contain pointers to objects 
in a code or data section of another file. 

All of these references to absolute addresses must 
be "relocatable" since at the time of compilation and 
linking, the compiler/linker has no way of knowing what 
will be the ultimate addresses in memory at which the 
various referenced objects will be loaded. Thus 
references in an executable object code file to an 
address in a code section are often represented in a 
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form which is merely relative to the start of the code 
section, and^ references to an object in a data section 
are represented in a form which is merely relative to^ 
the starting address of the data section. The loader 
program is then able to perform a relocation of these 
references by, after a referenced section is loaded into 
memory and the start address of that section is known, 
merely adding that start address to all of the 
references to objects within that section. 

References to external symbols are typically 
represented in an executable object code file as indices 
into a symbol import/export table which is also 
contained in the file, each import entry in the import/ 
export table identifying both the name of one of the 
symbols and the external file which should contain that 
symbol. The indices are often numbered consecutively, 
and as will be seen below, the table also contains 
export entries . When the loader program encounters a 
reference to an external symbol, it loads the external 
file and determines the address of the referenced 
symbol. The loader program then relocates the reference 
by adding in the address of the referenced symbol. 

In order to determine the address of an import 
symbol, the loader searches for the symbol, usually by 
name, among the exports in the symbol import /export 
table of the external file. The entry which contains 
the desired export also designates the address of the 
symbol by designating the section number of the external 
file containing the symbol, and an offset into that 
section. 

The loader section of an executable object code file 
typically includes a relocation table containing entries 
which specify how each relocatable reference is to be 
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relocated upon loading into memory. For example, for a 
relocatable ^reference to an object which is within a 
code section, the relocation table contains a 
specification that the number to be added to the 
5 reference is the start address of the code section, 
rather than the start address of some other section. 
Similarly, for a relocatable reference to an object 
which is contained within a data section, the relocation 
table contains an entry specifying that the number to be 

10 added to the relocatable reference is the start address 
of the data section rather than of a code section. For 
a relocatable reference to an external symbol, the 
relocation table contains a corresponding specification 
of the index to the desired entry in the symbol import 

15 table. 

When the loader program begins operation, it 
retrieves the desired executable object code file from 
a mass storage device such as a disk. If the computer 
system permits multiple tasks to be resident 
20 simultaneously using a shared object code section, then 
a separate copy of the data section (s) is (are) made for 
each task which will use the loaded file. In one 
example of memory organization, the loader may first 
check whether the desired file is already present in 

2 5 memory for another task. If not, the loader loads the 

header, code, data and loader sections of the file into 
a portion of memory which is read-only to individual 
users. In either case, the loader then makes a copy of 
the data section (s) in read/write memory for the new 

3 0 task. 

If the loader has been invoked to load several files 
or modules into memory at the same time, then these 
files, too, are loaded into memory in the same manner as 
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the first file. All the references to external symbols 
are resolved*at this time, by inserting into each file's 
symbol import table the address into which each symbol' 



recursively. That is, when one module (e.g. an 
application program) references a symbol in a second 
module (e.g. a library) , the loader may load and perform 
all relocations on the second module before returning to 
resolve the symbol in the first module. Similarly/ if 
the second module references a symbol in a third module, 
the loader may load and perform all relocations in the 
third module before returning to the second, and so on. 

After the various sections of file have been loaded 
into memory, and imports have been resolved, the loader 
performs the relocation process. The relocation process 
is performed by traversing the relocation table in the 
loader section, and performing the specified relocation 
operation for each of the relocatable references 
contained within the current file. 

One popular format for executable object code files 
is known as XCOFF. XCOFF is described in the following 
articles published in IBM, "R6000 Inf oExplorer " (CD-ROM, 
1992) : 

"a. out File Format", 

"Optional Auxiliary Header for the a. out File", 

"Section Headers for the a. out File", 

"Raw Data Sections for the a. out File", 

"Special Data Sections for the a. out File", 

"Relocation Information for the a . out File", 

"xcof f .h" , 

"f ilehdr .h" , 

"reloch", 

"scnhdr .h" , 

"loader .h" ; 

all incorporated herein by reference. In XCOFF, the 
loader section has the following portions: 



has been loaded. 



Symbol imports may be resolved 
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Header Portion 

External Symbol Import /Export Table 
Relocation Table 
Import File Ids 
Loader String Table. 

The Header Portion of the Loader Section contains a 
version number and sufficient information to navigate 
the remainder of the Loader Section. 

The External Symbol Import/Export Table contains one 
24 -byte entry for each of the external references for 
the file. The entries are unsorted, with no grouping of 
imports separately from exports, and no grouping of 
imports from one external file separate from imports 
from another external file. Each entry in the External 
Symbol Import/Export Table has the following fields: 



TABLE I 



20 



25 



Field Name 
l_name 

l_value 

1 - scnum 

l_smtype 



l__smclas 
1 ifile 



Length 
Bytes 

8 



1 
4 



Description 

Full symbol name (if ^ 8 bytes) , 
or offset to full symbol name in 
the Loader String Table. 

Offset into 1-scnum where export 
symbol is located; unused if 
import . 

Section number containing the 
export symbol, or 0 if import or 
not defined. 

Symbol type, and flags 
indicating whether the symbol is 
an import, export, and/or entry 
point description . 

Symbol storage class. 

Import file id: ordinal of 
import file Ids in Loader Import 
File Ids portion. Specifies 
source for import symbols. 
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Length 

Field Name - Bytes Description 
l_parTn 4 Parameter type check field. 

The Relocation Table contains the relocations 
information for the XCOFF file. Each entry in the 
5 Relocation Table is 12 bytes long and contains the 
following fields: 

TABLE II 

Length 

Field Name (Bytes) Description 

10 l__vaddr 4 Offset within section number 

specified in l_rsecnm, of an 
information item to be 
relocated. 

l__symndx 4 External symbol import table 

index of object that is being 
referenced. 

l_rtype 2 Type of relocation. 

l_,rsecnm 2 Nvimber of the section 

containing the relocatable 
item governed by this table 
entry, 

The l_symndx field of a relocation table entry 
15 specifies whether the item to be relocated is a 
reference to an external symbol, or to an object in one 
of the code or data sections. Specifically, values of 
1 and 2 indicate that the reference is to an object in 
a .data or .bss section respectively (both of which are 
20 considered "data sections" as the term is used herein) , 
and a value of 0 indicates that the reference is to an 
object in a .text section (code) . Values of 3 or higher 
constitute indices into the external symbol import table 
for the file, and indicate that the relocatable 
25 reference in the information item is a reference to the 
corresponding external symbol. In this case, the 
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relocatable reference in the information item itself may 
contain 0, or an offset value to which the address of 
the external symbol will be added. Note that while, 
relocation table entries have the capacity to control 
5 relocations of information items contained in the code 
section, this capacity is rarely used on computer 
systems which support relative branching. For these 
systems, when a compiler generates a branch instruction 
for the code section, it cypically uses the relative 

10 branch format so as to obviate any need for a 
relocation. When the compiler generates an instruct ion 
which references a data object, it typically uses an 
indexed addressing mechanism for which only the offset 
from the base address of the data section need be 

15 included in the ultimately executed code. The software 
pre-loads the starting address of the desired data 
section into a register to use as the base address. 
Further, in the situation where code sections are 
sharable, relocations are avoided in the code section 

20 also because the relocation appropriate for one task may 
not be the same as the relocation appropriate for 
another task sharing the same code section. 

The l_rtype field indicates the type of relocation 
which is to be performed, and most commonly contains a 

25 value indicating that the reference is an absolute 
32 -bit reference to the virtual address of the object. 

l_rsecnm indicates the section number containing the 
information item to be relocated. As with the l_symndx 
field, certain predefined values are implicit references 

30 to the .text, .data and .bss sections, respectively. 

The Import File Ids portion of the XCOFF file 
contains variable length strings, each carrying the file 
Id for a respective one of the import files referenced 
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ordinally in the l_ifile field of an import entry in the 
External Symbol Import /Export Table. Similarly, the 
Loader String Table porion of the file contains variable 
length strings, each carrying the text name of a 
respective one of the import and export symbols 
referenced in the l_name field of an import or export 
entry in the External Symbol Import/Export Table. 

One problem with the XCOFF file format is that it is 
extremely inefficient in terms of space occupied in the 
mass storage device, in terms of memory usage at launch 
time, and in terms of the time required to launch an 
application. The space which an XCOFF file occupies in 
mass storage is in large part due to the fact that XCOFF 
requires 12 bytes of relocation information for each 4- 
byte word in the data section that requires relocation. 
Thus in an executable object code file containing 1.5 
megabytes, as much as 300k bytes might be occupied by 
the relocation table. The relocation table space 
overhead is also a large factor in the inefficient usage 
of memory at launch time. The inefficiency of launch 
time speed performance is due in part to the need to 
retrieve and interpret 12 bytes for every relocation to 
be performed. 

The inefficiency of the XCOFF file format is due 
also in part to the absence of any organization in the 
External Symbol Import/Export Table. This means that 
when the loader program needs to import a symbol 
exported by an XCOFF file, it must search through the 
entire External Symbol Import/Export Table until it is 
found. This can amount to hundreds or thousands of 
entries in some programs, which may need to be searched 
hundreds or thousands of times (once for each import 
from the file) . Moreover, each search can involve a 
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lengthy string comparison of the symbol name of an 
export in the Table, with the desired symbol import 
name . 

An XCOFF loader program may attempt to speed this 
5 process by reading the entire External Symbol Import/ 
Export Table of an imported XCOFF file into read/write 
memory before performing any comparisons, but this 
itself takes time and requires an extensive amount of 
memory. Additionally, the entire Loader String Table 

10 would also need to be loaded into memory, and in a 
virtual memory computer system there is a strong 
likelihood that the need to repeatedly access the 
External Symbol Import/Export Table entries and the 
Loader String Table in an alternating manner will cause 

15 repeated page faults and further degrade performance. 
Moreover, a typical importing file may need to import 
only 10% of the symbols exported by the imported file 
being read into memory in this manner. Thus the vast 
majority of the entries in the External Symbol 

20 Import/Export Table and the Loader String Table which 
are brought into memory in order to speed import 
resolution, may never be used. 

In addition to XCOFF, another conventional format 
for executable object code files is used in the GEM disk 

25 operating system for Atari ST computers. See K. Peel, 
"The Concise Atari ST 68000 Programmer's Reference 
Guide" (Glentcp Publishers: 1986), especially pp. 2-21 
through 2-24. The entire Peel guide is incorporated 
herein by reference, 

30 In the GEM format, the loader section of an 

executable object code file consists of a series of 
bytes , each of which specifies at most a single 
relocation. A loader routine maintains a pointer into 
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the program being loaded, and updates the pointer in 
dependence ^upon each byte in the loader section. 
Specifically, if a byte in the loader section contains 
any number between 2-255 inclusive, the loader routine 
5 advances the pointer by the specified number of bytes 
and adds the start program address to the 32 -bit 
relocatable reference then pointed to by the pointer. 
If the byte in the loader section contains the value 1, 
then the loader routine advances the pointer by 254 

10 bytes without performing a relocation. A zero byte in 
the loader section indicates the end of relocations . 

The GEM executable object code file format and 
loader routine are extremely primitive, lacking any 
capability for symbol imports and exports, for separate 

15 code and data sections, or for any kind of relocation 
other than the addition of the start program address to 
a 32 -bit relocatable reference. Additionally, like the 
XCOFF format, the GEM format still contains a relocation 
table entry (byte) for each relocation to be performed. 

20 

SUMMARY OF THE INVENTION 
The invention takes advantage of certain 
characteristics of executable object code files to 
drastically reduce the number of bytes of relocation 

25 information which are required per relocation. In 
particular, roughly described, relocation table entries 
in an executable object code file are interpreted as 
relocation instructions rather than individual 
specifications for a particular respective relocatable 

3 0 information item. An abstract machine is provided for 
interpreting the relocation instructions and performing 
various relocation operations and various control 
functions for the abstract machine, in response to the 
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relocation instructions. The abstract machine maintains 
certain variables containing information which is 
referenced and updated in response to certain types of 
the relocation instructions, thereby obviating the need 
5 to include such information as part of each relocation 
instruction . Certain of the relocation instruction 
types can also specify a particular relocation operation 
to be performed on a run of n consecutive relocatable 
information items, where n is specified as part of the 

10 relocation instruction . Other types of relocation 
instructions are also made available. 

In another aspect of the invention, an executable 
obj ect code file separates the import symbol table from 
the export symbol table, and in a combined string table, 

15 locates the import symbol name strings closer to the 
import symbol table and locates the export symbol name 
strings closer to the export symbol table. The export 
symbol table is also sorted according to a hashing 
algorithm. Additionally, pattern initialized data 

20 sections are stored as data expansion instructions to 
further reduce the size of the object file. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The invention will be described with respect to 
25 particular embodiments thereof, and references will be 
made to the drawings , in which like elements are given 
like designations, and in which: 

Fig. 1 is a symbolic, simplified block diagram of a 
computer system incorporating the invention; 
30 Fig. 2 is a simplified flow chart illustrating the 

broad functions of a loader program using the present 
invention; 
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Fig. 3 is a symbolic diagram of information storage 
in the memory of Fig. 1; 

Fig. 4 is a flow chart detailing the RESOLVE IMPORTS- 
step of Fig. 2; 

5 Fig. 5 is a flow chart detailing the PERFORM 

RELOCATIONS step of Fig. 2; 

Fig. 6 illustrates the structure of a data section 
generated by a typical compiler; 

Figs. 7A-7G illustrate field definitions for 
10 relocation instructions recognized by the abstract 
machine; 

Fig. 8 is a flowchart detailing the FIND NEXT IMPORT 
SYMBOL NAME step in Fig. 4; 

Fig. 9 is a simplified flowchart illustrating the 
15 broad functions of a program for constructing a file 
according to the present invention; 

Fig. 10 is a flowchart detailing the CONVERT SYMBOLS 
step of Fig. 9; 

Fig. 11 is a flowchart detailing the SORT PEF EXPORT 
20 SYMBOLS step of Fig. 9; 

Fig. 12 is a flowchart detailing the CONVERT 
RELOCATIONS TO PEF RELOCATION INSTRUCTI&NS step of 
Fig. 9; 

Fig. 13 is a flowchart detailing the DO SYMR, SYMB, 
25 LSYM INSTRUCTIONS step of Fig. 12; 

Fig. 14 is a flowchart detailing the DO DDATA, CODE, 
DATA INSTRUCTIONS step of Fig. 12; and 

Fig. 15 is a flowchart detailing the DO DESC 
INSTRUCTIONS step of Fig. 12. 

30 

DETAILED DESCRIPTION 
Fig. 1 is a symbolic block diagram of a computer 
system in which the invention is implemented. It 
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comprises a CPU 102 which is coupled to a bus 104. The 
bus 104 is also coupled to a memory 106 and a disk drive 
108 . Numerous types of computer systems may be used, . 
but in the present embodiment, computer software is 
5 usually stored on the disk drive 108 and brought into 
memory 106 over the bus 104 in response to commands 
issued over bus 104 by the CPU 102. The CPU 102 runs an 
operating system which, possibly together with 
specialized hardware, supports virtual addressing. That 

10 is, a computer program running on CPU 102 can reference 
memory addresses in an address space which is much 
larger than the physical space in memory 106, with the 
operating system and/or specialized hardware swapping 
pages from memory 106 to and from the disk drive 108 as 

15 necessary. The operating system also supports multi- 
tasking, in which two or more tasks can execute 
different copies of the same or different software in a 
parallel or time- sliced manner. 

Executable obj ect code files stored on the disk 

20 drive 108 are referred to herein as containers, and 
those that incorporate features of the presently 
described embodiment are referred to as PEF containers 
(that is, they follow a "PEF" format) . Such files may 
also be retrieved from a different part of memory 106, 

25 or from another resource (not shown) . A PEF container 
may contain the compiler output from a single source 
code module or, if the compiler is able to combine more 
than one source code m.odule, the output from a plurality 
of source code modules. Several different containers 

30 (PEF or otherwise) may be required to form a complete 
program. 

OVERALL PEF CONTAINER STRUCTURE 
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Each PEF container contains the following portions: 
a container header, N section headers, a string table, 
and N section data portions. The container header is of- 
a fixed size and contains global information about the 
5 structure of the PEF container. The exact contents of 
the container header are unimportant for an 
understanding of the present invention and are therefore 
not described herein. 

The section headers portion of a PEF container 

10 includes a section header for each of N "sections". 
Every PEF section, regardless of whether it includes any 
raw data, has a corresponding section header. Such 
sections include, for example, loadable sections such as 
code, data and pattern initialized data (pidata) 

15 sections, and nonloadable sections such as a "loader" 
section. Each of the sections has a corresponding 
section number which is derived from the order of the 
section headers in the section header portion of the PEF 
container, starting with section 0. Section 0 is 

20 typically a code section and section 1 is typically a 
data section, but this is not a requirement. 

Each section header is 28 bytes long and has the 
following structure : 

TABLE III 

25 No. of 

Bytes Description 

4 Section name (offset into global string 

table) . 

4 Desired section address in memory (tvpicallv 

0) . 

4 Execution size in bytes. 

4 Initialization size in bytes (prior to any 

zero- initialized extension). 

4 Raw size in bytes . 
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Offset from beginning of container to this 
section's raw data. 

Region kind (code section, data section, 
pidata section, loader section, etc.). 

Memory alignment required for this section 
(byte alignment, half-word alignment, full- 
word alignment, etc.). 

Sharing kind (unsharable section, sharable 
within a context; sharable across all context 
within a task; sharable across all contexts 
within a team; or sharable across all 
contexts . 

Reserved. 



a 



10 



15 



20 



The exact meaning of each of the elements in 
section header is unnecessary for an understanding of 
the present invention, except as included in the table 
above, and except to note that the section kinds include 
the following sections: 

TABLE IV 
Kind Description 

code A code section is loadable and contains 

read-only executable object code in binary 
format , in an uncompressed, directly 
executable format. 



data A data section is loadable and contains 

initialized read/write data followed by 
read/write zero- initialized data. The 
presence of zero- initialized read/write 
data is indicated by the section size 
being larger than the image size. A PEF 
data section combines the .data and .bss 
sections of a standard UNIX executable 
object code file. Data sections are not 
^0 compressed. 

pidata A Pidata section is loadable and 
identifies a read/write data region 
initialized by a pattern specification 
contained in the section's raw data in the 
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Object file. In other words, the raw data 
portion of a PEF container which 
corresponds to a pidata header contains a 
small program that is interpreted by the 
5 loader to determine how the section of 

memory should be initialized. Zero 
extension is automatically handled by the 
loader for pattern- initialized data, as 
with the data section. The format of a 
10 pidata section is described below. 

constant A constant section is loadable and 
contains read-only data. The loader can 
place it in read-only memory, and a 
15 constant section is not compressed. 

loader The loader section is nonloadable and 
contains information about imports , 
exports and entry points . The format of 
20 the loader section is discussed below. 

debug The debug section is nonloadable and 

contains all information necessary for 
symbolic debugging of the container. 
25 Multiple formats can be supported, 

including the format of a conventional 
XCOFF -debug section. 



3 0 The PEF container format includes no region kind 

specifically for zero- initialized data, similar to a 
.bss section in a conventional XCOFF file. Rather, 
zero- initialized sections are achieved in PEF simply by 
specifying a data section with an init size of 0. The 

35 loader uses the exec size to determine how much space to 
allocate to the section in memory, and then uses the 
difference between the exec size and the init size to 
determine how much space to zero- initialize . Thus the 
effect of a data section with a 0 init size is to zero- 

40 initialize the entire data section in memory. Note 
there is no requirement that zero- initialized data 
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sections immediately follow other data sections in 
memory . 

The section data portion of the PEF container 
contains raw section data corresponding to each of the 
5 headers in the section header portion of the PEF 
container (except for zero- initialized sections) . The 
sequence of section data portions need not be the same 
as the sequence of their corresponding section headers. 
The raw data within certain sections including code, 
10 data and loader sections are required to be 8-byte 
(double- word) aligned. For code sections, the raw data 
simply contains the executable object code. For data 
sections, the raw data contains the initialized data for 
the section. For the loader section, the raw data 
15 contains additional information needed by the loader as 
set forth in more detail below. 
Loader Section Format 

The loader section contains the following portions: 
Loader section header 
20 Table of Import Container ID's 

Import Symbol Table 
Relocation headers 
Relocation instructions 
Loader String Table 
25 Export Slot Table 

Export Chain Table 
Export Symbol Table. 
Loader Section Header . The loader section header 
.has a fixed size, and provides information about the 
30 structure and relative location of the various 
components of the loader section. Most of the entries 
in the loader section header are unimportant for an 
understanding of the present invention, except to note 
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that it contains, ainong other things, one entry 
specifying the number of entries present in the Table of 
Import Container ID's, one entry indicating the niimbel: 
of entries in the "import symbol table" portion of the 
loader section, one entry indicating the number of 
sections in the PEF container which require relocations, 
one entry indicating the byte offset from the start of 
the loader section to the start of the relocation 
instructions (Relocation Table) , one entry indicating an 
offset to the Loader String Table, one entry indicating 
an offset to the Export Slot Table, a "hash slot count" 
{indicating the number of "slots" in the Export Slot 
Table) , and an entry indicating the number of symbols 
which are exported from this object file (including re- 
15 exports of imports) . All tables use zero-based indices. 

Tabl e of Import Container ID^s . The Table of Import 
Container ID's contains an entiry for each external 
container which may contain a symbol to be imported into 
the present PEF container. Such external containers may 
20 follow a conventional format such as XCOFF, or 
preferably follow the same format as described herein 
for a PEF container.- Each entry in the Table of Import 
Container ID's contains, among other information which 
is not important for an understanding of the present 
25 invention, the following fields: an offset into the 
Loader String Table to indicate the name of the 
container, a count of a number of symbols which are 
contained in the Import Symbol Table for the particular 
external container, and a zero-based index of the first 
entry in the Import Symbol Table (in the present file) 
for the particular external container. 

Import — Symbol Table . The Import Symbol Table 

contains one entry for each external symbol to be 



30 
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imported. Each Import Symbol Table entry contains a 
symbol class -identifier together with an offset into the 
Loader String Table to indicate the name of the symbol.- 
All of the symbols to be imported from a given one of 
the external containers are grouped together in the 
Import Symbol Table, although within that group they may 
appear in any order. Imported symbols may be in any of 
the following classes: an address in code, an address in 
data, a pointer to a transition vector (also referred to 
herein as a descriptor or procedure pointer) , a TOC 
symbol, or a linker inserted glue symbol. 

Relocation Headers . The Relocation Headers portion 
of the loader section of a PEF container contains one 
entry for each loadable section of the PEF container 
which includes one or more information items to be 
relocated. As used herein, an "information item" may be 
a word, a double word, or any other unit of information 
which contains one or more relocatable address fields. 
Unlike the conventional file format, in which each 
relocatable information item consisted of exactly one 
relocatable address field, a relocatable information 
item in PEF may contain any number of relocatable 
address fields as defined by the relocation instruction. 

Note that although relocations of information items 
which are contained within a code section are supported 
by the PEF container definition, code sections 
preferably do not include any load time relocations at 
all. This permits code sections to be located outside 
of user-writable memory, and also permits code sections 
to be sharable. Note also that though multiple data 
sections are supported, typically only one data section 
will exist. 
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Each header in che Relocation Headers portion of the 
loader section of a PEF container has the following 
format : 

TABLE V 

5 No. of 

Bytes Description 

4 Section number. 

^0 4 Number of bytes of relocation instructions 

for this section. 

4 Offset from the start of the Relocation 

Table to the first relocation instruction 
1^ for this section. 

The section number field of a relocation header 
identifies the section number of a loadable section 
whose information items are to be relocated. Section 

2 0 number 0 may, for example, be a code section, and 
section number l may, for example, be a data section. 
As set forth above, the section numbers are assigned 
depending on their sequence in the "Section Headers" 
portion of the PEF container. Also as set forth above, 

25 no relocation header will typically be included for a 
code section since code sections preferably contain no 
relocatable information items. Additionally, section 
numbers -1 and -2 have special meanings which are not 
here relevant; these sections do not contain 

30 relocations. Section number -3 indicates a re-export- 
of- import, and does not contain any relocations. 

Relocation Table. The Relocation Table in the 

loader section of a PEF container consists of an array 
of relocation instructions. All of the relocation 

35 instructions for information items within a given 
section of the PEF container are grouped together. The 
relocation instructions are two or four bytes in length 
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15 



20 



25 



and contain an opcode in the high -order bits. The 
remaining fields of the instruction depend upon the 
opcode and are described hereinafter. 



contains all of the strings referenced by other portions 
of the loader section. The strings are all variable in 
length, and include both import and export symbol names 
and file names of imported libraries. The loader string 
table is located in the middle of the loader section, 
after the import symbol table and relocation table, but 
before the symbol export portions of the loader section. 
All of the import symbols are grouped together and 
located nearest the import symbol table, and all of the 
export symbols are grouped together and located nearest 
the export symbol table. Both the grouping of the 
symbols and their locations relative to the two symbol 
tables tend to improve performance on loading in a 
virtual memory computer system, since page hits are more 
likely . 

Symbol Export Portions . The PEF container 

organization hashes a container's export symbols into a 
hash data, structure at container build time. This 
considerably speeds the search for exported symbols when 
they are to be imported into another container at load 
time. The symbol export portions of the loader section 
of a PEF container include an Export Slot Table, an 
Export Chain Table, and an Export Symbol Table. 

In the hash data structure, the Export Slot Table 
contains an entry for each hash "slot" in the data 
structure. Each entry contains a chain count indicating 
the number of export symbols which have been placed in 
the slot corresponding to the entry's index, and an 



Loader String Table . 



The loader string table 
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index into the Export Chain Table of the first Export 
Chain Table entry for the slot. 

The Export Chain Table contains an entry for each 
export symbol, grouped according to their slots. Each 
5 entry contains the hashword value for the entry's export 
symbol . 

The Export Symbol Table contains an entry for each 
symbol in the current PEF container which may be 
imported by another container. These entries include an 

10 offset into the Loader String Table of the current 
container to identify the name of the symbol, the number 
of the section in the current PEF container which 
contains the symbol, and the offset into that section at 
which the symbol may be found. 

15 Thus to locate an export symbol given its name, the 

loader program first computes the hashword value for the 
symbol name using a predefined Name- to- Hash- Word 
function. It then uses a predefined Hash- Word- to-Hash- 
Slot -Number function, together with the size of the 

2 0 Export Slot Table in the given container (obtained form 

the Loader Section header) to compute the slot number 
for the export symbol. It uses the hash slot number as 
an index into the Export Slot Table, from where it 
fetches the chain count and the Export Chain Table index 
25 of the first Export Chain Table entry for the slot. 
While the count is valid, the Export Chain Table is 
searched starting at the Export Chain Table index 
previously fetched. When a hashword matches that 
computed for the desired export symbol, the Export Chain 

3 0 Table index of that hashword designates the export index 

number of a potential name match. To see if the name 
really matches, the loader program applies the export 
index to the Export Symbol Table, which contains the 
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offset to the export name ' s text , which is compared to 
the desired 'export symbol name. If it matches, then the 
information about the export identified by that index i^ 
returned. Otherwise, as long as the count is still 
5 valid (if not, symbol is not found) , the next export 
index is checked. 



information about an export symbol by (zero based) 
10 index, the PEF Loader applies the index to the Export 
Chain Table to fetch the hashword value for that symbol, 
or applies the index to the Export Symbol Table to fetch 
the rest of the information about the value. The Export 
Symbol Table contains a pointer to the actual name text. 
15 The general process for construction of the Export 

Slot Table and the Export Chain Table in the presently 
described embodiment is as follows: 



A PEF loader program can also locate an export 
symbol given its export symbol index. To find 



20 



1 . 



Compute the number of elements in the Export 
Slot Table. This is based upon the number of 
exports, but can vary depending on desired 
Export Slot Table size overhead. Write the 
Export Slot Table size into the Loader Section 
header . 



25 



2 . 



For each export, compute its hashword value 
using the Name- to -Hash -Word function. Then 
compute its export hash slot value based upon 
the Export Slot Table size, using the Hash-Word- 
to-Hash- Slot -Number function . 



30 



35 



3 . 



Sort the exports by hash slot number . This 
order groups together all exports with the same 
slot number, and creates the (zero based) symbol 
index for each export . At each index in the 
Export Chain Table, write the hash word for the 
export . Ac each index in the Export Symbol 
Table, write the symbol table entry. 



40 



4 . 



Construct the Export Slot Table (size given by 
step 1) . Each entry in this table has the count 
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Of exports, as well as the index of the first 
export whose hash slot values collide at that 
hash slot index. 

5 The following is a preferred method for computing an 

Export Slot Table size for use in the hashing process, 
although other methods could be used instead. The hash 
slot table size function computes an appropriate size 
for the Export Slot Table, given the number of exports. 
10 The size is always a power of 2, and therefore the 
output of this function is to give the number of the 
power of 2. The constant kAvdChainSize is normally 5, 
but can vary as desired for a size vs. speed trade off. 
Note: there is a very small penalty for longer hash slot 
15 chains, which result from shorter hash slot tables, 
because the data structures are optimized to account for 
hash slot collisions, 
int NumSlotBits { long exportCount ) 
register int i; 
for (i = 0; i < 13; i++) { 

^ if ( exportCount / (1 << i) < }cAvgChainSize ) break; 

if (i < 10) return i+1; 
return i; 
} /* NuxnSlotBits () */ 

30 The preferred Name- to -Hash -Word function used to 

compute the hashword for export symbols in the loader 
section, encodes the length of the identifier as the 
high order 16 bits, and the low order bits are 
constructed by accumulating for each byte the following 

3 5 operation: rotate the current hash value 1 bit, and xor 
in the next byte of the name. It is implemented as 
follows : 

#define ROTL(x) ( ( (x) << l ) - ( (x) >> (16) ) ) 



20 



25 



40 



/* produce hash value for id given length (length == 0 --> null 
terminated) */ 
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\insigned long Hash (register unsigned char *naine , register int 
length) 

{ 

register long hash = 0; 
5 register int len = 0; 

while (*naine) { 

hash = ROTL ( hash ) ; 
hash *= *naine++ ; 
10 len++; 

if (--length == 0) break; 

} 

return (unsigned short) (hash (hash >> 16) ) + (len << 16) ; 
} /* Hash 0 */ 

15 

The Hash-Word- to-Hash-Slot-Number function converts 

a hash value as computed above, into an appropriately 

small index number. The preferred slot function is 

based upon the size of the Export Slot Table, also 

20 computed as above. 

^^define HashSlot (h,S,M) ( ( (h) " { (h) >> (S) ) ) & (M) ) 
hashSlotlndex = HashSlot ( hashWord32, htShift, (1 << htShift) - 1 ) 

Pidata Section Format 

25 The raw data of a pidata section is organized as a 

sequence of data expansion instructions Each 
instruction begins with a 3 -bit opcode followed by a 5- 
bit count, together in the first byte. Then depending 
on the opcode, additional counts and/or raw data bytes 

30 may follow. The opcodes are as follows: 



Opcode 0: Zero #Count 

This opcode takes one count parameter. It clears Count bytes starting at the current data 
3 5 location. 

Opcode 1: Block #Count (Count # of raw data b\tes) 

This opcode takes one count parameter, and count number of raw data bytes. It deposits 
40 the raw data bytes into the data location, essentially performing a block byte transfer. 

Opcode 2: Repeat ffCountl, MCount 2 (Count I ft of raw data bytes) 
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20 



This opcode takes two count parameters, and Countl number of raw data bytes It 
repeats the Countl number of raw data bytes RCount2 times. 

^ Opcode 3: RepeatBlock #Count I, #DCount2. #RCount3 (raw data bytes) 

This opcode takes three count parametres and Countl + DCountl * RCountS number 
of raw data bytes. The raw data bytes are interpreted as follows: the first Countl bytes 
make up the "repeating pattern". The next DCount bytes make up the first "non- 
repeatmg part" . There are RCountS number of "non-repeating pan"s. The opcode first 
10 places the "repeating pattern", then one "non-repeating part". This is performed 
RCountS times, each time using die same "repeating pattern" and a new "non-repeating 
part . Lastly, one more "repeating pattern" is placed. 

Opcode 4: Repeat Zero itCountl. UDCountl, RCountS (raw data bytes) 

This opcode takes three count parameters and DCountl * RCountS number of raw data 
bytes The raw data bytes are interpreted as follows: The "repeating pattern" is made 
up of Countl bytes that are not represented in the raw data bytes. The first DCount 
bytes make up the first "non-repeating part". There are RCountS number of "non- 
repeating part"s. The opcode first places the "repeating pattern" (zeroes), then one 
non-repeating part". This is performed RCountS times, each time using the same 
repeatmg pattern" (zeroes) and a new "non-repeating part". Lastly, one more 
repeating pattern" (zeroes) is placed. 

LOADER PROGRAM 

Fig. 2 is a simplified flow chart illustratiing broad 
functions of a loader program which may employ the 
present invention. The LOAD CONTAINER routine 2 02 
illustrated in the figure is recursive as hereinafter 
described. 

In a step 204, the entire PEF container then being 
loaded is brought from the source of the container (disk 
drive 108, another portion of memory 106, or a resource 
(not shown) ) into a portion of memory 106 which is not 
writable by a user program. If the desired container is 
already present in memory 106, having been previously 
loaded by another task, then this step 204 is skipped. 

Fig- 3 is a symbolic diagram of memory array 106 for 
an illustration of the LOAD CONTAINER routine for a 
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situation in which the primary PEF container is an 
application * program, and in which the application 
program includes calls to external symbols which are all 
located within a second PEF container referred to herein 
5 as a library container. As shown in Fig. 3, the 
application program is loaded in step 204 into a region 
302 of the read-only portion of memory 106. The 
application program includes (among other sections not 
shown) a header section loaded into a memory region 304, 

10 a code section loaded into a memory region 306, a data 
section loaded into a memory region 308, and a loader 
section loaded into a memory region 310. 

Referring again to Fig. 2, after the container is 
loaded into memory 106, the LOAD CONTAINER routine 

15 allocates and fills read/write memory regions for all 
the data and pidata sections (step 206) . For the 
illustration of Fig. 3, this step involves copying the 
data section from region 308 of memory 106 into a region 
312 in a read/write part of memory 106. The regions to 

20 be allocated in this step are identified in the header 
section 3 04 of the application program, and the data to 
be copied or otherwise written into the allocated memory 
regions derive from the corresponding section data 
portions of the application program. 

25 Also in step 206, a regions [i] table is created to 

indicate the virtual start address for each region i 
created for the present container. Specifically, if the 
present container includes only one code section, and it 
is identified in the Section Headers portion of the 

30 container as a section 0, then regions [0] will carry the 
virtual address in read-only memory at which the code 
section was placed. In the example of Fig. 3, 

regions [0] will carry the virtual address of the start 
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of memory region 306. Similarly, if section 1 of the 
container i^ a data section, then the table entry 
regions [1] will carry the virtual address of the data- 
section in memory 106 as copied for use by the current 
5 task. In the example of Fig. 3, the table entry for 
regions [1] would carry the virtual address of memory 106 
region 312. As mentioned, multiple code and data 
sections are supported and may occur in any order. Each 
is given a number, and the regions [i] table indicates 

10 where each section is located in memory 106. 

Referring again to Fig. 2, after the regions [i] 
table is created, all references to external symbols are 
resolved in a step 208 and in a manner hereinafter 
described. (Note that in the present embodiment, though 

15 not important for an understanding of the present 
invention, certain external symbol imports need not be 
resolved at this time if they carry a flag indicating 
that the import need not be resolved.) The result of 
the RESOLVE IMPORTS step 208 is an imports [j] table 

2 0 which carries the virtual address of each of the symbols 

j which are referenced in the present container. The 
RESOLVE IMPORTS step 208 is described in more detail 
below. 

After the imports are resolved, a decision is made 
25 in a step 212 as to whether any relocation headers are 
present in the Relocation Headers portion of the loader 
section of the PEF container. If not, then the user 
program is now fully loaded into memory 106 and the LOAD 
CONTAINER routine 202 is complete. The user program may 

3 0 be executed at this time. If the Relocation Headers 

portion of the loader section does contain one or more 
relocation headers, then the loader executes a PERFORM 
RELOCATIONS step 210 on the section indicated by the 
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next relocation header. As mentioned above, there is no 
requirement ' that the sequence in which sections are 
relocated be the same as the sequence of the sections in 
the PEF container, nor is there any requirement that 

5 sections be relocated in numerical order. After the 
relocations are performed on all the information items 
within the section identified in the current relocation 
header, the index into the relocation headers is 
incremented in a step 216, and the routine returns to 

0 the decision step 212 in order to determine whether any 
further Relocation Headers remain in the Relocation 
Headers portion of the loader section of the current PEF 



sections have been processed. 

15 S\Tmbol Import Resolution 

Fig. 4 is a flow chart detailing the RESOLVE IMPORTS 
step 208 (Fig. 2) . Generally described, the RESOLVE 
IMPORTS routine 2 08 loops through all of the Import 
Container ID's in the Table of Import Container ID'S in 

20 the Loader section header of the current PEF container. 
For each such Import Container ID, the RESOLVE IMPORTS 
routine 208 recursively calls the LOAD CONTAINER routine 
to bring the referenced container into memory and to 
recursively resolve the imports specified therein. Once 

25 control returns to the RESOLVE IMPORTS routine for the 
present container (referred to herein as the "importing 
container"), the routine cycles through all of the 
entries in the Import Symbol Table in the loader section 
of the importing container. For each such entry, the 

3 0 routine compares the name of the imported symbol to 
various ones of the strings identified in the Export 
Symbol Table of the container most recently loaded 
(referred to herein as the "imported container") until 



container . 



This process continues until all of such 
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a match is found. Such comparisons are facilitated by 
the hashed organization of PEF export symbol tables. 
The virtual address of the imported symbol is then- 
calculated from the corresponding entry of the Export 
5 Symbol Table of the imported container and stored in the 
imports [j] table. 

Accordingly, the RESOLVE IMPORTS routine 208 begins 
with a step 402 in which it is determined whether any 
entries remain in the Table of Import Container ID'S for 

10 the present container. If not, then the RESOLVE IMPORTS 
routine 208 is finished and the routine is exited in a 
step 404. If entries do remain in the Table of Import 
Container ID'S, the LOAD CONTAINER routine is 
recursively invoked, in a step 406, for the next 

15 referenced container. As previously mentioned, 

containers need not all follow the PEF format, and if 
the next specified container is not in the PEF format, 
then another appropriate routine is invoked instead of 
LOAD CONTAINER. 

2 0 After control is returned to the RESOLVE IMPORTS 

routine 208 for the present container, a determination 
is made whether the Loader Import Symbol Table in the 
loader section of the present container includes any 
further symbols to be imported from the container most 

25 recently loaded (step 408) . If not, then control is 
returned to step 402 to determine whether the present 
container requires loading of any further external 
containers . 

In the illustration of Fig. 3, recursive execution 
30 of the LOAD CONTAINER routine 202 results in the loading 
of the library container into a region 314 of the read- 
only part of memory 106. The container header of the 
library container is placed in a region 316, the code 
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section is placed in a region 318, the data section is 
placed in a region 320, and the loader section for the 
library container is placed in a region 322. The data* 
section 320 is also copied to region 324 in the 
read/write part of memory 106 for the present task. A 
separate regions [i] table is created for the library 
container and used as hereinafter described to perform 
relocations in the library container and, after 
processing resumes for the application program, to 
calculate the virtual addresses of symbols imported from 
the library container. 

Returning to the flow chart of Fig, 4, after it is 
determined that additional symbols remain to be imported 
from the most recently loaded container, the RESOLVE 
IMPORTS routine 2 08 finds the next import symbol name 
(from the Import Symbol Table of the present container) 
in the Export Symbol Table of the container most 
recently loaded (step 410) . This is accomplished by the 
steps illustrated in Fig. 8. 

Referring to Fig. 8, the step 410 begins by 
computing the hashword value for the desired import 
symbol name using the predefined Name- to -Hash -Word 
function (step 802) . In a step 804, the routine uses 
the predefined Hash -Word -To -Hash -Slot - Number function, 
together with the size of the Export Slot Table, to 
compute the export slot number where the import symbol 
should be found. In step 806, the routine uses the hash 
slot number as an index into the Export Slot Table, from 
where it fetches the chain count and the Export Chain 
Table index of the first Export Chain Table entry for 
the slot. In a step 808, the routine loops through the 
Export Chain Table, starting from the index previously 
fetched, for up to the number of entries designated by 
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the chain count. For each entry in the Export Chain 
Table, the routine determines whether the hashword for 
that entry matches that computed for the desired import 
symbol (step 810) . If not, then the loop continues 
(808) . If so, then the Export Chain Table index of that 
hashword is applied Export S^nmbol Table to obtain the 
offset to the export name's text in the Loader String 
Table (step 812), and it is determined whether the 
corresponding Loader String Table entry matches the 
desired import symbol name (step 814) . If not, then the 
loop continues (step 808) , and if so, then the import 
has been found in the export symbol table and the 
routine returns to the caller (step 816) . 

Thus the completion of step 410 results in an index 
to the referenced entry in the Export Symbol Table of 
the most recently loaded container. In a step 412, the 
information in that Export Symbol Table entry is used to 
calculate the virtual address of the symbol to be 
imported, which is then stored in the imports [j] table 
for the importing container. In particular, the virtual 
address of the imported symbol is calculated as the 
virtual address of the section in the imported container 
which contains the imported symbol, plus the offset into 
that section where the symbol is located. The virtual 
address of the section in the imported container which 
contains the imported symbol is taken from the 
regions [i] table for the imported container, and both i 
and the offset are taken from the imported container's 
Export Symbol Table entry which has been identified for 
the symbol to be imported. The symbol number j is the 
symbol number as seen by the present, or importing 
container . 
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Note that if the section number in the Export Symbol 
Table entry for the desired symbol contains the special 
section number -2, then the virtual address is taken^ 
directly from the offset field of that entry. If the 
5 section number is -3 (indicating a re-export of a symbol 
imported into the library container from yet a third 
container) , then the offset is used as an index into the 
library's import symbol table. 

After the virtual address of the present symbol is 

10 calculated and stored in imports [j], j is incremented in 
a step 414 and control is returned to the decision step 
408 to determine whether any further symbols need to be 
imported from the most recently loaded container. 
Relocatincr References 

15 Fig. 5 is a flow chart detailing the PERFORM 

RELOCATIONS step 210 in Fig. 2. As mentioned, the 
PERFORM RELOCATIONS routine is invoked for each of the 
loadable sections i in the current PEF container, in the 
sequence specified in sequential relocation headers in 

2 0 the current PEF container. The current section number 
i is passed as a parameter to PERFORM RELOCATIONS 
routine. A C- language procedure which includes the 
PERFORM RELOCATIONS routine 210 is set out in 
Appendix A. 

25 Referring to Fig. 5, the routine first checks for 

errors in a step 502, and aborts if an error is 
detected. Such errors can include the receipt of a 
section number which is less than zero or greater than 
the total number of sections in the current container. 

30 Another error might be that the section number provided 
by the calling routine is a constant section or the 
loader section, neither of which contain any relocatable 
items. 
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After the initial error checking step 502, several 
variables are initialized in a step 504. As will be 
seen, the individual elements of the Relocation Table 
are interpreted at a much higher level in the present 
5 embodiment than would relocation table entries in a 
conventional relocation table. This interpretation is 
performed in the present embodiment entirely in 
software, by routines which simulate decoding and 
execution of the relocation instructions in the 
10 Relocation Table. Thus the interpretation software 
forms a "pseudo machine" or "abstract machine" for 
interpreting and executing the relocation instructions. 

Accordingly, the following abstract machine 
variables are initialized in the step 504: 

15 reloc Pointer to the next relocation instruction 

in the Relocation Table of the loader 
section of the current PEF container. 
Initialized to point to the first 
relocation instruction for the current 

2 0 section number as passed to the PER.FORM 

RELOCATIONS routine 210. 

rlEnd Pointer to the first byte after the last 

relocation instruction for the current 

25 section number as passed to the PERFORM 

RELOCATIONS routine 210. Initialized to 
that value by adding, to reloc, the number 
of bytes of relocation instruction as 
specified in the relocation header for the 

30 current section. 

rAddr Pointer to the next information item in 

the raw data for the current section. 
Initialized to the memory address of the 
35 beginning of the memory region for the 

current section number. 

rSyml A symbol index (count value) which is used 

to access a corresponding import's address 
40 in the imports [i] table. Initialized to 

the value 0 for each new section. 
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Pointer to the starting memory address of 
one of the loadable sections in the 
current PEF container. Initialized to 
point to the starting memory address of 
section number 0 if section number 0 is 
present and loadable , otherwise it is 
initialized to the value 0 . As will be 
seen, certain relocation instructions can 
change the value of codeA, thereby 
affecting subsequent relocations by 
relocation instructions that refer to this 
variable . 

Pointer to the memory address of another 
one of the loadable sections in the 
current PEF container. Initialized to 
point to the starting memory address of 
section number 1 if section number 1 is 
present and loadable, otherwise 
initialized to the value 0. As with code 
A, certain relocation instructions can 
change the value of dataA, thereby 
affecting subsequent relocations by 
instructions that refer to this variable. 

A counter variable used in the repeat 
instructions, described below,. 
Initialized to the value 0. 

3 0 The abstract machine variables codeA and dataA do 

not attempt to define which section is a code section 
and which section is a data section in the container, 
except to supply initial default values for these 
variables. The variables can point to any of the 

35 container's loadable sections, and can be changed as 
often as desired through the use of the relocation 
instructions . 

Note that although codeA and dataA are described 
above as pointing to the memory address of a loaded 

40 section, they do so by storing a value equal to the 
memory address of the loaded section minus the default 
address for that section . The default address for a 
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section is obtained from the desired Section Address 
field in that section's Section Header. The default 
address for the various sections are typically 0,. 
however, so the value contained by the codeA and dataA 
5 variables will typically equal the starting address of 
the desired section in memory. 

After the abstract machine variables are 
initialized, a decision is made as to whether there are 
any (further) relocation instructions corresponding to 

10 information items within the current section (step 506) . 
This decision is made by testing whether reloc is less 
than rlEnd. If not, then the relocations are complete 
for the current section and PERFORM RELOCATIONS routine 
210 is exited (step 508) . 

15 If more relocation instructions exist for the 

present section, then the next relocation instruction is 
retrieved and copied into a variable r (step 510) . 
(Note that in another embodiment, the relocation 
instruction may be retrieved, for example as part of a 

20 step which tests its contents, without actually copying 
it into a variable for subsequent reference.) The 
variable reloc is then incremented by two bytes in a 
step 512. As will be seen, most of the relocation 
instructions are two bytes in length, although a few of 

25 them are four bytes in length. In step 510, only the 
first two bytes are copied into r. After step 510, 
reloc points to the next relocation instruction if the 
current one is only 2 bytes in length, or to the second 
, half-word of the current relocation instruction if it is 

30 4 bytes in length. 

All of the instruction foirmats begin with an opcode 
in the high-order bits. Referring to Fig. 5, the opcode 
is used in a "switch" step 514 to pass control to one of 
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a plurality of routines depending upon the contents of 
the opcode portion of the relocation instruction now in 



information items to be relocated are assumed to contain 
■5 one or more relocatable references, each in the form of 
a 4 -byte virtual address field, and relocation is 
accomplished by adding the contents of some specified 
pointer (a pointer to the start of a region in memory or 
a pointer taken from an import) to the relocatable 

10 reference . 

After completion of the particular routine called 
for by the opcode in the retrieved relocation 
instruction, control returns to the decision step 506 in 
order to determine whether any further relocation 

15 instructions have been provided for the present section. 
If so, then the next relocation instruction is retrieved 
and the loop continues. If not, then the PERFORM 
RELOCATIONS routine 210 is exited. 

2 0 RELOCATION INSTRUCTIONS 

Relocation instructions in the present embodiment 
are objects that are packed as opcode and arguments. 
Some of the relocation instructions act as directives 
for the abstract machine itself , while others instruct 

25 that a relocation needs to be performed, and others 
instruct that many relocations of a particular type are 
to be performed. The various field definitions for the 
different types of instructions are fully set forth in 
the following C language typedef . 



the variable r . 



In the present embodiment, all 
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typedef union { 

struct f unsigned op: 7, rest:9; 

struct f lihsigned op: 2, delta_d4:8, cnt : 6 ; 

struct ( unsigned op : 7 , cnt_ml : 9 ; 

struct ( unsigned op : 7 , idx:9; 

struct unsigned op : 4 , delta_ml:12; 

struct i unsigned op : 4 , icnt_ml:4, rcnt__ml : 8 , 

struct •[ unsigned op : 6 , id3c__top : 10 ; 

struct { unsigned op : 6 , cnt_ml : 4 , idx_top : 6 ; 

Reloclnstr instr; 

TUnsignedlS bot ; 
} Relocation; 



opcode ; 

deltadata; 

run; 

gip; 
delta; 
rpt ; 
largel ; 
large 2 ; 



15 



20 



25 



30 



35 



Before proceeding, it will be useful to understand 
the structure of a data section 600 generated by a 
typical compiler. Such a structure is illustrated in 
Fig: 6. Grouped at the start of the data section 600 is 
initialized user data in a portion 602. Following the 
initialized user data portion 602, compilers typically 
place a group of contiguous transition vectors (also 
known as procedure descriptors) in a region 604. 
Transition vectors are used by external callers to 
procedures which are referenced by the transition 
vectors. Thus transition vectors are common especially 
in executable object code files produced on compilation 
of libraries. The compiler typically creates a 

transition vector for each procedure compiled, although 
by the time relocations are being performed by a loader 
program, the transition vectors for procedures which are 
never called by external callers typically have been 
eliminated. 

Transition vectors are usually created in one of two 
formats, indicated in Fig. 6 as formats 606 and 608, 
respectively. A given compiler typically produces 
transition vectors in only one of the two formats 606 or 
608. In format 606, each transition vector contains 
three 4-byte words. The first word is a pointer 610 to 
the start of a procedure in the code section, and the 
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second word 612 is a pointer to a known location in the 
TOC for the- data section 600. The third word 614 
contains an "environment word" which is rarely used and. 
is not important for an understanding of the present 
5- invention. As created by the compiler, the pointer 610 
contains an offset into the code section for the desired 
procedure and pointer 612 contains an offset into the 
data section 600 usually for the start of the TOC. Thus 
the first word of each transition vector 606 must be 
10 relocated by the loader program through the addition of 
the start address of the code section in the current 
container. This is referred to as a "code- type 
relocation". Similarly, the second word of each 
transition vector 606 must be relocated by the loader 
15 program through the addition of the start address of a 
data section, typically the same data section 600 which 
contains the transition vector 606. This is known as a 
"data- type relocation". The environment word 614 in 
each of the transition vectors 606 does not need to be 
20 relocated. 

Transition vectors of the second type 608 are the 
same as those of the first type 606, except that the 
third word of each transition vector is omitted. 

Following the transition vectors 604, compilers 
25 typically place a table of pointers, also known as a 
table of contents (TOC) in a portion 616 of the data 
section 600. The TOC contains various pointers 618 into 
the code section, pointers 620 into the data section 
600, and pointers 622 to external symbols which are to 
be imported. These pointers are typically intermixed, 
although it is frequent that several of one type are 
grouped together. As produced by a compiler, pointers 
into the code section carry only the offset into the 



30 
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code section and pointers into the data section carry 
only the offset into the data section. These types of 
pointers 618 and 620 therefore require code and data-^ 
type relocations, respectively. Pointers 622 to 

5 external symbols typically contain 0 upon completion of 
compilation, although they are permitted to contain an 
offset to which the address of the desired symbol will 
be added. The latter address is taken from an 
appropriate entry in the imports [i] table created by the 
10 RESOLVE IMPORTS routine 208 (Fig, 4) . This type of 
relocation is referred to herein as a "symbol -type 
relocation" . 

Considering first the code -type relocations, it can 
be seen that the relocation table entries to accomplish 

15 such relocations in conventional executable object code 
formats require 12 bytes of relocation information for 
each 4 -byte information item to be relocated. This is 
due in part to the fact that conventional format 
requires all of the following information to appear in 

20 each 12 -byte relocation table entry: the section number 
containing the information item to be relocated, the 
offset within that section of the information item to be 
relocated, the section number whose starting address 
must be added to the information item, and an indication 

25 that the information item is a 32-bit word. In the 
present embodiment, on the other hand, most of this 
information either is maintained in one of the abstract 
machine variables, or is inherent in the relocation 
instruction opcode. in particular, the offset in the 

30 current section of the information item to be relocated, 
is maintained in the abstract machine variable rAddr 
together with the section number containing the 
information item to be relocated. Similarly, the number 
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Of different relocatable address fields in the 
information item to be relocated, and the fact that each 
is to be treated as a 32 -bit pointer are inherent in th^' 
relocation instruction opcode, since a separate routine 
5 can be provided to perform the relocations specified by 
each relocation instruction opcode. Additionally, the 
section number whose starting address in memory is to be 
added to the offset value in the relocatable address 
field is also specified inherently by many of the 

10 relocation instruction opcodes. 

Moreover, whereas the conventional executable object 
code file format requires one relocation table entry for 
each address field to be relocated, the loader program 
of the present embodiment takes advantage of the fact 

15 that as mentioned above, several relocations of the same 
type are often grouped together (as in the TOC) . That 
is, a compiler may generate five pointers 618 into the 
code section sequentially, followed by four pointers 620 
into the data section, followed by seven pointers 622 to 

20 external symbols, followed by more pointers 620 into the 
data section, and so on. The loader program of the 
present embodiment takes advantage of this 
characteristic by providing a class of relocation 
instructions which specify a type of relocation (e.g. 

25 code, data, symbol) together with a count of the number 
of such relocations to perform sequentially (i.e. in a 
"run" ) . 

Many of the relocation instructions which 
accommodate a run of similar type relocations follow the 
30 "run" instruction format illustrated in Fig. 7A. In 
this format, the high- order three bits carry an opcode 
of 010, the next four bits carry a subopcode indicating 
the type of relocation to perform, and the low-order 
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nine bits carry a count CNT9 indicating the number of 
consecutive ^ information items to be relocated in the 
manner specified by the subopcode . 

One such relocation instruction type is CODE which, 
5 in a step 516 (Fig. 5) adds the value in abstract 
machine variable codeA to the information item specified 
by the machine variable rAddr for CNT9 consecutive 
information items. rAddr is incremented by four after 
each individual one of the relocations so as to point to 

10 the next information item in the run. Another such 
relocation instruction type is DATA which, in a step 518 
adds the value in abstract machine variable dataA to the 
information item specified by the abstract machine 
variable rAddr for CNT9 consecutive four-byte 

15 information items. 

Another relocation instruction type, SYMR, performs 
a run of symbol -type relocations. For each of the 
symbol -type relocations in the run, the information item 
specified by rAddr is relocated by adding the value from 

20 imports [rSyml] , both rSyml and rAddr being incremented 
by 1 and 4, respectively, after each such relocation 
operation in the run. As mentioned, rSyml is 

initialized to zero at the start of processing a 
section. Unless specifically reset by an SYMB or LSYM 

25 instruction (described below) , rSyml increases 
monotonically as the section is processed . 

It can be seen that the availability of these three 
relocation- type instructions (CODE, DATA and SYMR) 
themselves can drastically reduce the number of bytes of 

30 Relocation Table information which are required to 
perform the required relocations on information items 
contained within a TOC. The DESC and DSC2 relocation 
instruction types can provide an even more drastic 
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reduction. As previously mentioned, in a typical 
executable object code file, all of the transition 
vectors are grouped together in a portion 604 (Fig. 6) 
For transition vectors of type 606, each transition 
5 vector requires a code- type relocation, followed by a 
data-type relocation, followed by no relocation. For 
transition vectors of type 608, each transition vector 
requires a code- type relocation followed by a data- type 
relocation. In either case, the conventional executable 

10 object code format required 24 bytes (two 12 -byte 
relocation table entries) to specify a transition vector 
relocation. In the present embodiment, on the other 
hand, a run- type relocation instruction, referred to 
herein as DESC, is provided for relocating a run of 

15 transition vectors of type 606. The routine is 
implemented in DESC step 522 (Fig, 5) in which, for each 
three-word information item 606 in the run, the value in 
abstract machine variable codeA is added to the word 
pointed to by rAddr and rAddr is post - incremented by 

20 four. The value in abstract machine variable dataA is 
then added to the word pointed to by rAddr, and rAddr is 
post - incremented again by four. rAddr is then 

incremented by four a third time to skip over the 
environment word 614. This procedure is repeated for 

25 each information item in the specified run. 

The DSC2 relocation instruction type is similar to 
the DESC relocation instruction type, except that it is 
adapted for use on transition vectors of type 608. DSC2 
instructions are processed in step 524 by a routine 

30 which is identical co the DESC routine 522 except for 
the omission of the third incrementing of rAddr. 

Accordingly, it can be seen the relocation of an 
entire region 604 of transition vectors can be performed 
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using a loader program of the present embodiment, in 
response to- a single 2-byte relocation instruction. 
This represents a drastic savings over conventional, 
loader programs . 

Within the above framework, numerous types of 
relocation instruction can be developed to simplify 
relocations on other rommonly occurring relocatable 
information item formats as they are recognized. For 
example, one relocatable information item format which 
is frequently produced in the initialized user data area 
602 by C++ compilers is in the form of a one-word 
pointer to an object in the data section, followed by 
one word of nonrelocatable data. Thus in the present 
embodiment, a VTBL relocation instruction type is 
provided and processed in a step 526 which adds the 
value in the abstract machine variable dataA to the 
first word of each information item in a run, and leaves 
the second word of each information item in the run 
unchanged. Many other information item formats can be 
accommodated by other relocation instruction types . 

Certain relocation instruction types modify the 
abstract machine variables without performing any actual 
relocation operation. For example, a DELTA- type 

relocation instruction merely adds a specified value to 
rAddr, thereby effectively skipping over a number of 
information items which are not to be relocated. The 
format of the DELTA instruction is illustrated in Fig. 
7B, and in particular includes a 4 -bit high- order opcode 
of 1000, followed by a 12 -bit DELTA value B12 . DELTA 
instructions are processed in a step 528 (Fig. 5) , which 
merely adds 812 + 1 to the value in abstract machine 
variable rAddr. 
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•The GDIS instruction also modifies the value of an 
abstract machine variable without performing any 
relocations. In particular, it loads the codeA variable 
with the starting memory address of a specified one of 
the loadable sections in the present PEF container. The 
GDIS instruction follows the format of Fig. 7C and 
contains a high- order 3 -bit opcode of Oil, followed by 
a 4 -bit subopcode indicating the GDIS instiruction, 
followed by a 9 -bit IDX9 field for specifying, the 
section number whose' start address is to be loaded into 
codeA. The GDIS instruction is processed in step 530 
(Fig. 5), which merely copies the pointer in 
regions [IDX9] into the abstract machine variable codeA. 

A similar instiruction which follows the same opcode 
format as GDIS is the instruction DTIS, processed in a 
step 532 (Fig. 5) . The step 532 merely copies the value 
in regions [IDX9 ] into the abstract machine variable 
dataA. The GDIS and DTIS instructions are most useful 
when multiple code and/or data sections are produced by 
the compilation system. 

Gertain of the relocation instruction types 
supported in the present embodiment both perform a 
relocation operation and also further modify one of the 
abstract machine variables. For example, the DDAT 
instruction first increments rAddr by a specified number 
of words (similar to DELTA) , and then performs a 
specified number of contiguous data- type relocations 
(similar to DATA) . The DDAT instruction follows the 
format illustrated in Fig. 7D, which calls for a 2 -bit 
high-order opcode of 00, followed by an 8-bit W8 field 
indicating the number of words of information items to 
skip, followed by an N6 field indicating the number of 
data- type relocations to subsequently perform, DDAT 
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instructions are processed in a step 534 (Fig, 5) , in a 
routine which first increments rAddr by four times the 
value in W8 , and then adds the value in dataA to each of. 



incremented after each such relocation operation such 
that when the instruction is completed, rAddr points to 
the next information item after the run. 

SYMB is another instruction which modifies an 
abstract machine variable and then performs a relocation 
operation. The SYMB instruction follows the format of 
Fig. 7C, and causes the abstract machine to first load 
rSyml with a new symbol number specified by IDX9 , and 
then (similarly to the SYMR instruction for a run count 
of 1) add the value in imports [rSyml] to the relocatable 
information item then pointed to by rAddr. Both rSyml 
and rAddr are then incremented. The SYMB instruction is 
processed in a step 536 (Fig. 5) . 

The SECN instruction, which also follows the format 
of Fig. 7C, performs a single relocation to a section 
whose number is specified in IDX9 . In step 538 
(Fig. 5) , the routine for processing SECN instructions 
merely adds the value in regions [IDX9 ] to the 
information item pointed to by rAddr, and increments 
rAddr . 

The RPT instruction permits further compaction of 
the relocation table by providing a means of repeating 
a series of relocation instructions on a specified 
number of consecutive groups of infoirmation items. The 
RPT instruction follows the format of Fig. 7E, which 
includes a high-order 4-bit opcode of 1001, followed by 
a 4 -bit number 14, followed by an 8-bit CNT8 field. 
When invoked, the abstract machine repeats the most 
recent 14 relocation instructions, CNT8 times. The RPT 



the N6 following information items. 
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instruction is processed in a step 540 (Fig. 5) which 
first decrements the abstract machine variable rpt and 
tests for its value. If rpt is now less than 0, this is 
the first time that the RPT instruction was encountered 
5 and rpt is initialized to CNT8 + 1. Then, regardless of 
the value of rpt, the abstract machine variable reloc is 
decremented by the number of half-words specified by 14 
and the routine is exited. This has the effect of 
causing the abstract machine to back up by 14 half-words 

10 of relocation instructions, and to repeat them. Note 
that since rAddr is not decremented, the repeated 
relocation instructions, will operate on the next group 
of unrelocated information items (rather than on a 
repeated group of information items) . If the initial 

15 decrementing of rpt brought that abstract machine 
variable to 0, then the RPT instruction is determined to 
be complete and the routine 540 is exited. 

The number specified in 14 is a number of half-words 
rather than a number of relocation instructions. Thus 

20 if the repetition will include one or more 4 -byte 
relocation instructions, then the value in 14 must 
include an additional count for each of such 4 -byte 
relocation instructions. Also, the value in 14 is one 
less than the number of half-words to repeat (that is, 

25 14-0 specifies a one half-word repeat, 14 = 1 
specifies a two half-word repeat, etc.) 

The LABS instruction is a 4 -byte relocation 
instruction which follows the format of Fig. 7F. 
Specifically, the high-order 4 bits contain an opcode of 

30 1010; the next two bits contain a subopcode of 00; and 
the low- order 26 bits carry a value OFFSET26. The LABS 
instruction is processed in a step 542, which adds the 
value OFFSET26 to the start address of the section whose 
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information items are currently being relocated, and 
loads the result into abstract machine variable rAddr. 
Thus LABS performs a "jump" to a specific information. 



instruction is a 4-byte instruction, the routine 542 
concludes by incrementing reloc by another two bytes. 

The LSYM instruction also follows the format of 
Fig. 7F, with a subopcode of 01. It performs the 
functions of setting a new symbol number in rSyml, and 
subsequently performing a symbol -type relocation on the 
information item then pointed to by rAddr. Both rSyml 
and rAddr are then post- incremented. The new symbol 
number is specified in the LSYM instruction in the field 
OFFSET26. The LSYM instruction is processed in step 
544 . 

The LRPT instruction follows the format of Fig. 7G, 
in which the high- order four bits carry an opcode of 
1011, the next two bits carry a subopcode of 00, the 
next four bits carry a CNT4 value and the last 22 bits 
carry a CNT22 value. LRPT is similar to RPT, but allows 
for larger repeat counts. Specifically, when the LRPT 
step 546 is invoked, the last CNT4 2 -byte relocation 
instructions are repeated the number of times specified 
in CNT22. Again, since LRPT is 4-byte instruction, step 
546 concludes, after the last repetition, by 
incrementing reloc by another two bytes. 

The instruction format of Fig, 7G is also used for 
three other 4-byte relocation instructions where the 
subopcode is 01 • and CNT4 specifies which of the three 
instructions are to be invoked. The three instructions, 
all processed by LSECs step 548 (Fig. 5), are LSECN 
(which is the same as SECN but with a very large section 
number), LCDIS (which is the same as . GDIS, but with a 



item within the current section. 



Since the LABS 
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very large section number) , and LDTIS (which is the same 
as DTIS, but ^ with a very large section number) . Again, 
since these instructions are 4 -byte instructions, LSECs 
step 548 concludes by incrementing reloc by another two 
5 bytes. 

If the opcode of the current relocation instruction 
pointed to by reloc is not recognized, then default step 
550 indicates an error and breaks. 



10 CONSTRUCTING A PEF FILE 

A PEF file in the format described above can be 
created directly from compilers. Alternatively, a 
conventional compiler can be used to create a 
conventional format executable file, and a conversion 

15 tool used to convert the conventional format executable 
file into a PEF format file. As an illustration of 
procedures which may be used to create a PEF format 
file. Appendix B is C- language program which does so 
beginning from an XCOFF- format file. The program will 

20 now be described roughly by reference to flowchart Figs. 
9-15, it being understood that a more precise 
understanding of the procedures can be found in 
Appendix B. 

Fig. 9 illustrates the overall flow of the program. 
25 In an initialization step 902, the XCOFF file is opened 
and read into memory. The various XCOFF sections are 
located in memory and their addresses stored in 
variables . 

In a step 904, the import symbols, export symbols 
30 and import file Ids are extracted from the XCOFF file 
(in-memory copy) and converted to the organization used 
in the PEF file format. Fig. lO is a flowchart 



3NSOOaD: <WO__94220r7aA^J_> 



wo 94/22078 




PCT/US94/02952 



- 51 - 

illustrating the symbol conversion step 904 in more 
detail . 

Referring to Fig. 10, all of the import file Ids are 
first extracted from the XCOFF file. Specifically, it 
5 will be recalled that the import file Ids are all set 
forth in the XCOFF file as variable length ' strings in 
the Import File Ids portion of the Loader Section of the 
XCOFF file. The ordinal of these strings represents an 
import file number, referred to in the XCOFF External 

10 Symbol Import/Export Table. The step 1002 thus involves 
copying each of the import file Id strings into the 
newly created PEF Loader String Table, and writing the 
offset of each string into a corresponding entry in the 
Import Container Ids porion of the PEF Loader Section. 

15 a step 1004, the XCOFF imports are extracted from 

the XCOFF External Symbol Import /Export Table and Loader 
String Table. This is accomplished by traversing the 
XCOFF External Symbol Import/Export Table looking for 
import symbols, writing the import symbol name into the 

20 PEF Loader String Table and an accompanying offset to 
the string into the PEF Import Symbol Table. This 
traversal is performed separately for each import file 
number, so that the resulting import symbols are grouped 
by import file number. 

25 ^ step 1006, export symbols are extracted from 

the XCOFF file. Specifically, the XCOFF External Symbol 
Import/Export Table and Loader String Table is traversed 
looking for export symbols, and for each such export 
symbol, the symbol name is written to the PEF Loader 

30 String Table and an accompanying offset is written to 
the PEF Export Symbol Table. The routine also at this 
time computes the hash value for the export symbol and 
^^i-t.es it into the corresponding entry of the Export 
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Chain Table. This traversal is performed separately and 
sequentially for different PEF section numbers, thereby 
resulting in a list of export symbols grouped by PEF 
section number . 

In a step 1008, the convert symbols routine 904 
returns to the caller . 

Returning to Fig. 9, in a step 906, the program 
sorts the PEF export symbols as set forth in more detail 
in Fig. 11. As indicated in Fig. 11, in a step 1102, 
the routine first computes the size of the Export Slot 
Table given the number of exports contained in the XCOFF 
External Symbol Import/Export Table. In a step 1104, 
the procedure loops through all of the export symbols. 
For each export symbol, the procedure computes its hash 
15 slot value given the Export Slot Table size. The symbol 
is added to a linked list for that hash slot value. The 
Export Slot Table is created in the process, with the 
chain index field of each entry temporarily pointing to 
the linked list for the slot, 

3- Step 1106, a determination is made whether more 
than a predefined number m of the slots contain more 
than a predefined number n export symbols. If so, then 
the Export Slot Table is too small. The size is doubled 
in a step 1108, and step 1104 is repeated. If the 
25 Export Slot Table is not too small, then the Export 
Chain Table and Export Symbol Table are created from the 
linked lists. Specifically, for each slot in the Export 
Slot Table, the linked list for that slot is walked to 
create these two tables. The chain index field for the 
slot in the Export Slot Table is overwritten with the 
correct index into the Export Chain Table. The 
procedure then returns to the caller (step 1112) . 



30 
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Afcer Che PEF export symbols are sorted (Fig. 9), 
the relocations in the XCOFF file are converted to PEF 
format (step 908). Fig. 12 illustrates a procedure 
which accomplishes this task. 
5 Referring to Fig. 12, in a step 1202, a 

determination is made as to whether the code and data 
sections of the PEF file have their default numbering. 
If not, then a GDIS and/or DTIS instruction is "emitted" 
in order to set the CodeA and/or the DataA abstract 
10 machine variables. The "emitting" step is described 
below. 

In a step 1204, the routine begins looping through 
all of the XCOFF Relocation Table entries. In step 
1206, it is determined whether the current entry in the 

15 Relocation Table is an import symbol relocation. If so, 
then appropriate SYMR, SYMB or LSYM instruction (s) are 
emitted (step 1208) and control returns to the looping 
step 1204. In step 1210, the routine counts the number 
of successive entries in the Relocation Table which 

20 designate the same relocation type (code or data) for 
successive longwords in the data section. If this count 
is greater than 1, then DDAT, CODE or DATA instructions 
are emitted as necessary (step 1212) and control returns 
to the looping step 1204. 

25 If the run count from step 1210 is s 1, then in step 

1214, a new run count is made of successive pairs of 
entries in the Relocation Table which designate, for 
three successive longwords in the data section, a Code 
type relocation, a Data type relocation, then no 

30 relocation. If this run count is a 1, then DESC 
instruction (s) are emitted (step 1216) and control 
returns to the looping step 1204. 
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If the run count from step 1214 is less than 1, then 
a new run count is made of successive pairs of entries 
in the Relocation Table which, for two successive 
longwords in the data section, designate a Code type' 
5 relocation followed by a Data type relocation (step 
1218) . If this 2run count is a 1, then DSC2 PEF 
instruction (s) are emitted (step 1220) and control 
returns to the looping step 1204. 

If the run count of step 1218 is less than 1, then 
10 in a step 1222, another run count is made of successive 
Relocation Table Entries designating, for two successive 
longwords in the data section, a Data type relocation 
followed by no relocation. If this run count is a 1 , 
then VTBL ins truction ( s ) are emitted as appropriate 
15 (step 1224) and control returns to the looping step 
1204 . 

If none of the above -described tests are successful, 
then the Relocation Table entry specifies a lone Data or 
Code type relocation . In a step 1226 , the routine 
20 therefore emits a DDAT, CODE or DATA instruction as 
appropriate, and returns control to the looping step 
1204 . 

When all of the Relocation Table entries have been 
traversed and converted to PEF relocation instructions, 

25 the routine 908 returns to the caller (step 1228) . 

Fig. 13 illustrates the routine 1208 to do the SYMR, 
SYMB and LSYM instructions. In a step 1302, a run count 
is made of successive import symbol relocations in the 
Relocation Table for successive PEF import symbol 

30 numbers. Note that in order to use the PEF SYMR/SYMB/ 
LSYM relocation instructions, these import symbol 
relocations must be to successive import symbols as they 
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are numbered after the conversion to the PEF format 
(step 904) , not before. 

In a step 1304, the delta between the first longword 
to be relocated for this run, and the last longword to' 
5 be relocated for the previous PEF instruction, is 
determined. If this delta is greater than 0, then a 
DELTA instruction, or if necessary an LABS instruction, 
is emitted (step 1306) . Then, if the run count is 
greater than l (step 1308) , one or more SYMR 

10 instructions are emitted (step 1310) and the routine 
returns to the caller (step 1312) . More than one SYMR 
instruction can be emitted if the run count is larger 
than the largest run count which can be represented in 
the run count field of the SYMR instruction. 

15 If the run count was equal to 1 (step 1308) , then in 

order of preference, either an SYMR, SYMB or LSYM < 
instruction is emitted (step 1314) and the routine 
returns to the caller (step 1312) . 

Fig. 14 sets forth a procedure to do the DDAT, CODE 

20 and DATA instructions 1212 (Fig. 12) . The procedure is 
reused in step 1226 (Fig. 12). Referring to Fig. 14, a 
determination is first made (step 1402) as to whether 
the run of entries is a run of Data type relocations. 
If so, then it is determined whether a DDAT instruction, 

25 possibly preceded by DELTA instruction, can successfully 
represent the run (step 1404) . The maximum run count 
which can be represented in a DDAT instruction is 
considered here, as are data alignment requirements. If 
such instruction (s) will work, then they are emitted 

30 (step 1406) and the procedure returns (step 1408) . 

If the DELTA/DDAT instructions will not accurately 
represent the run (step 1404) , or if the run is a run of 
Code type relocation (step 1402), then in a step 1410, 
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CODE or DATA instruction ( s ) are emitted as required. 
The step 1410 may be preceded by the emission of a DELTA 
or LABS instruction as required (step 1412) . The 
routine then returns to the caller (step 1408) . 

The step 1216 (Fig. 12) of doing DESC instructions 
is accomplished with a procedure set forth in Fig. 15. 
Referring to Fig. 15, if a DELTA or LABS instruction is 
required prior to the DESC instruction, then it is 
emitted in a step 1502. The DESC instruction ( s ) are 
then emitted in step 1504, and the routine returns to 
the caller in step 1506. The routines in Fig. 12 to do 
DSC2 instructions (step 1220) and VTBL instructions 
(step 1224) are identical to that set forth in Fig. 15, 
except that in step 1504 (Fig. 15), DSC2 instructions or 
15 VTBL instructions are emitted instead of DESC 
instructions . 

In the above description of the routine to convert 
the relocations from XCOFF entries to PEF relocation 
instructions (Fig. 12), whenever it has been stated that 
20 a PEF relocation instruction is "emitted", the emitted 
relocation instructions could be placed in a temporary 
buffer. At a subsequent time , the temporary buffer 
could be scanned for repeating patterns and RPT or LRPT 
instructions generated therefor. 

Another technique for generating the RPT and LRPT 
instructions is to identify patterns as they are being 
created. In this techniques, PEF instructions which are 
"emitted" by the relocation conversion routine 908 are 
held in a first-in, first-out (FIFO) buffer until the 
buffer is full or a pattern is detected. If the buffer 
is full, then the first PEF instruction in the buffer is 
outputted, and the newly emitted instruction is added to 
the end; the routine then returns. If a newly emitted 
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instruction starts or continues a partial match of 
instructions in the buffer, then a match size variable 



completes a match, then the match size variable returns 
to zero and a repeat count variable increments. If the 
newly emitted instruction breaks a match, then the 
common instructions of the pattern are first outputted 
and deleted from the buffer, and then an appropriate RPT 
or LRPT instruction is outputted. The newly emitted 
instruction is then added to the buffer. Note that 
instructions outputted by this pattern matching process 
are still not output to the final PEF file; they are 
merely stored sequentially in a final in-memory version 
of a PEF Relocations buffer. 

Returning to Fig. 9, after all of the entries in the 
XCOFF Relocation Table have been converted to PEF 
relocation instructions (step 908) , the data sections 
themselves are packed using the pattern- initialized data 
(pidata) format described above (step 910) . In step 
912, now that all of the PEF section sizes have been 
established, offsets are computed and all of the headers 
are filled with the necessary information pointing to 
the different sections (step 912) . In step 914, each 
block of the PEF file is written out, with padding bytes 
inserted as necessary to meet data alignment 
requirements . 

The invention has been described with respect to 
particular embodiments thereof, and it will be 
understood that numerous modifications are possible 
without departing from its scope. For example, the 
invention may be applied to relocatable object code file 
formats produced by a compiler prior to linking, 
although the invention is less useful in that context 



merely increments . 



If 



the newly emitted instruction 
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due to Che relative scarcity of repeated patterns of 
relocations to be performed. 
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APPENDIX A (37 C.F.R. 5 1 . 9 6 ( a ) ( 2 ) ( i ) 
CopvricTht: 19 9 3 Apple Computer. Tnr 
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OSErr PLPrepareRegion ( TPLPrivatelnfoPlr ourPLInfo, 
TCount reglndex. 
TAddress regAddress ) 

register TAddress *raddr; 

register TOffset dataA; 

register int cnt; 

register TOffset codeA; 

LoaderRelExpHeaderPtr IdRelHdrPtr; 

Relocation *reloc» *rlend; 

Relocation r; 

long rpt; 

long seen; 

long rsymi; 

TOffset ^imports; 

TOffset ^regions; 

long i; 

long relNum; 

TOffset regStart; 

int err = noErr; 
#if NO GLOBALS 
^include "PEFtbl.h" 
^endif 



if (ourPUnfo == NULL) return paramErr; 

if((reglndex < 0) | | (reglndex >= ourPUnfo- > numRegions)) return paramErr; 
if (ourPUnfo- >sections[reglndex].regionKind == kLoaderSection) return paramErr; 
if (ourPUnfo- >sections[ reglndex]. regionKind = = kPIDataSection) return unimpErr! 

ourPUnfo -> resolved = 1; 

for (i = 0; ; i+-|-) { 

if (i > = ourPUnfo -> numRegions) return noErr; /*no relocations for this data section*/ 

IdRelHdrPtr = &. ourPUnfo -> IdSections [i]; 

if (IdRelHdrPtr -> sectionNumber == reglndex) break* 

} 

regions = ourPUnfo -> regionDeltas; 
imports = (TOffset *) ourPUnfo -> imports; 

regStart = (TOffset) regAddress; 

// subtract old default address from region address 

// region array will hold deltas from new address to old address 
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// for (i = 0; i < ourPUnfo -> numRegions; 

// regions [i] -= ourPLlnfo -> sections [i]. sectionAddress; 

// the regions array already holds deltas - no need to change it 

reloc = fReiocation*) (ourPLlnfo -> IdRelocations + IdRelHdrPtr - > relocationsOffset); 
5 rlend = CRelocation ***) (CReioclnstr '**) reloc + IdRelHdrPtr - > numRelocations); 

raddr = (TAddress *) regStart; 
rsymi — 0; 
code A = regions [0]; 
dataA = regions [I]; 
10 rpt = 0; 

relNum — 0; 
while (reloc < rlend) { 
r = *reloc; 

15 reloc = (Relocation (CReioclnstr reloc -f I); 

switch ( opcode [r. opcode. op] ) { 
case krDDAT : raddr = (TAddress *) 

(CTAddress) raddr -f r.deltadata.delta_d4 * 4); 
cnt = r.deltadata.cnt; 

2 0 while (~cnt > = 0) { 

// printf (''(0O04:%061x) (#%4d) l\n", (char raddr - regStart, relNum++); 

*raddr+ + + = dataA; 

} 

continue; 

25 

case krCODE : cnt = r.run.cnt_ml + 1; 

while (—cnt > = 0) { 
// printf (''(0004:%061x) (#%4d) 0\n\ (char raddr - regStart. relNum+ +); 

^raddr^ 4- H- = codeA; 

30 } 

continue; 

case krDATA : cnt = r.run.cnt_ml + 1; 

while (—cnt > = 0) { 

3 5 // printf ("(CXX)4:%06Ix) (#%4d) l\n", (char *) raddr - regStart. reINum++); 

*raddr-l-+ += dataA; 

} 

continue; 

4 0 case krDESC : cnt = r.run.cnt_ml -f 1; 

while (— cnt > = 0) { 
// printf ('•((XX)4:%061x) (^%4d) 0\x\\ (char *) raddr - regStart. relNum+ +); 

"♦^raddrH- +-!-=: codeA; 
// printf (''((XX)4:%061x) (#%4d) \\n\ (char *) raddr - regStart. relNum++); 
4 5 *raddr+-h -h = dataA; 

raddr+ + : 

} 

continue; 
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case krDSC2 : cnt = r.run.cnt_ml + 1; 

while (-cnt > = 0) { 
// printf ("(0004:%Q6Ix) (#%4d) 0\n\ (char raddr - regStart. reINum++); 

*raddr4- + = code A; 
5 // printf (70004: %06Ix) (#%4d) \\n\ (char raddr - reeStart. relNum+-f); 

*raddr++ -f = dataA; 

} 

continue; 

10 case krVTBL : cnt = r.run.cnt_ml + I; 

while (-cnt > = 0) { 
// printf ("(0004:%06Ix) (#%4d) l\n\ (char raddr - regStart, relNum+-f ); 

'^'raddr-h + 4- = dataA; 
raddr+ + ; 

15 } 

continue; 

case krSYMR : cnt = r.run.cnl_ml -h 1; 

while (--cnt > = 0) { 

20 // printf (*'(0004: %061x) (#%4d) %d\n'\ (char *) raddr - regStart, reINum+ + , rsymi); 

*raddr-f- + + = imports [rsymi + 4-]; 

} 

continue; 

2 5 case krSYMB : rsymi = r.glp.idx; 

// printf (''(0004:%06Ix) (#%4d) %d\n\ (char raddr - regStart, reINum+ rsymi); 
*raddr+ -h + = imports [rsymi + +]; 
continue; 

3 0 case krCDIS : codeA = regions [r.glp.idx]; 

continue; 



35 



case krDTIS : dataA = regions [r.glp.idx]; 
continue; 

case kj-SECN : •**raddr-l-+ += regions [r.glp.idx]; 
continue; 



case krDELT : raddr = (TAddress 

((TAddress) raddr + r. delta. delta_ml + 1); 
continue; 

case krRPT : if (-rpt ==0) continue; // count was 1 ~> rpt done 
if (rpt < 0) // first time rpt encountered? 

rpt = r.rpt.rcnt_ml -h 1; // yes- initialize rpt count 
cnt = r.rpt,icnt_ml 4- 2; // yes or no - back up cnt instrs 
reloc = (Relocation "*•) ((Reloclnstr reloc - cnt); 
continue; 
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case krLABS : raddr = (TAddress *) 

((r.largel.idx_top < < 16) + reloc -> bot + regStart); 
reloc = HRelocation '*') ((Reloclnstr *) reloc -f- 1); 
continue; 



10 



case krLSYM : rsymi = (r.Iargel .idx_top < < 16) -h reloc -> bot; 

reloc = (Relocation '*') (CRelocInstr *) reloc + 1); 
// printf ("(0004:%06lx) (#%4d) %d\n". (char raddr - regStart. relNum+ + , rsymi); 
'**raddr+ -h + = imports [rsymi + +]; 
continue; 



15 



20 



case krLRPT : if (-rpt = = 0) { 

reloc = (Relocation *) ((Reloclnstr reloc + 1); 
continue; 

} 

if (rpt < 0) 

rpt = (r.Iarge2,idx_top < < 16) + reloc -> bot; 
cnt = r. large!. cnt_ml + 2; 

reloc = (Relocation *) ((Reloclnstr '*') reloc - cnt); 
continue; 



case krLSEC 
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} 



default 
} 



seen = (r.large2.idx_top < < 16) -f reloc -> bot; 
switch (r.large2.cnt_ml) { 

case 0 : •*'raddr+-f -f = regions [seen]; break; 

case 1 : code A — regions [seen]; break; 

case 2 : dataA = regions [seen]; break; 

} 

reloc = (Relocation *) ((Reloclnstr *) reloc + 1); 
continue; 

err = unimpErr; 
break; 



// restore actual region addresses to the regions array 
// instead of region deltas 

// for (i = 0; i < ourPUnfo -> numRegions; i+-f-) 

// regions [i] += ourPLInfo -> sections [i]. sectionAddress; 

// the region array now always holds deltas - don't want to change it back 

return err; 
} /* PLPrepareRegion () */ 
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APPENDIX B (37 C.F.R. j^l. 96(a) (2) (i) 
Copyright: 1992 > 1993 Ap p le Computer. Inc. 
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*/ 



File: 
Contains: 
Written by: 
Copyri ght : 



^include <Errors.h> 

# include <Memory.h> 

^include <Fi les.h> 

^include <strTng.h> 

#if MAC 

#include <stdio,h> 
#endif 

/* #include <StdLib.h> 
#if MAC 

#include <CursorCt I . h> 
#end if 



// for some constants like kDefVersion: 
#include <CFLoader.h> 

#include "XCOFF.h" 
# include "PEF.h" 

typedef unsigned char byte; 

#define TRUE 1 
#define FALSE 0 



MakePEF.c 

XCOFF to PEF converter tool 
Erik L Eidt 

- 1992-1993 by Apple Computer, Inc., all rights reserved. 



/* jma */ 
/* jma ♦/ 
/* jma */ 



byte *Malloc ( long len ); 
long Write ( int fd, byte *buf. 



long len ); 



byte *PartTalName ( byte *name ); 

OSErr MakePStr ( byte *src, int len, byte *trg, int max ); 

void movebytes ( byte *s, byte *t, long Length ); 

void clearbytes ( byte *t, long length ); 

int comparebytes ( byte *s, byte *t, int length ); 

int OpenRead ( byte *fName, byte **buf, long *len ); 

long Get Int ( byte *ptr, byte **out, byte *end ); 

int mchlen ( byte *str, byte *mch ); 

int Log2 < unsigned long a In ); 

int CheckLibOpt ( byte **namep, int *nameLengthp, TVersion *linkedVer, TVersion *olditrcVer 

TBoolean *iB ); ^ ' 

int CheckXCOFF ( byte *theContainer ); 

int FindStuff ( byte *theContainer ); 

int ComputeOf f sets ( void ); 

int FillHeaders ( byte *inFileName ); 

long MapSecMunToPef f ( long secNm, long *defAddr ); 

long RelocAddr ( TLdRelPtr reloc ); 

long Reloclndex ( TLdRelPtr reloc ); 

void dodetta ( long relpos, long curpos ); 

int RelCvt ( TLdRelPtr reloc, long count ); 

int DataAdjust ( TLdRelPtr reloc, long count ); 
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long str2long ( byte **str ); 



Boolean TryPack i ngData ( void ); 

OSErr PackData ( register byte *ptr, register byte *end, byte *target, byte **tend ); 
5 OSErr UnpackOata ( register byte *ptr, register byte *end, register byte *trg, byte *tend 
); 

void PrtReloc ( Reloclnstr w1 ); 

void CheckMatch ( Reloclnstr wl , int matchStart ); 
10 void RestartMatch ( Reloclnstr w1 ); 

void AddReloc ( Reloclnstr wl, int matchStart ); 
long MacTime ( long unixTime ); 
long GetFileTime ( byte ♦fileName ); 

15 /* the following were not declared, hence generated warnings... jma */ 

int ProcessExportlist ( void ); 

void ComputeSections C void ); 

int SyrnCvt ( TLdSymPtr IdSyms, long syrnCnt ); 

int SymSort ( void ); 
2 0 int WriteFile ( byte *outF i leMame ); 

void Free ( byte *tofree ); 

long FindXCOFFIndexByName ( byte *name, TLdSymPtr IdSyms, long symCnt ); 
/* ******** * * *** ***** */ 



/» ******* ************************** *********** *. */ 

void convertString2Long (byte* theString, unsigned long* theLong); 
/* *************************** *... / 



#define badUnitErr -AS 



// The default version value is now private to this tool. 
// This is a legitimate version, not the wildcard value. 
35 #define kDefVersion 0 

long ALIGNMENT = 16; 
long oldDefVersion = kDefVersion; 
long otdlmpVersion = kDefVersion; 
40 long currentVersion = kDefVersion; 

byte dataSection = kDataSect i on; 
byte shareOption = kContextShare; 

#define CODE INDEX 0 
45 #define DATAINDEX 1 

#define BSSINDEX 2 
#define kSpecialSymbolCount 3 

int secNun C33 ; 
5 0 long locNLfn [33; 



// XCOFF stuff 

55 TFileHdrPtr fileHdrPtr; 

TAuxHdrPtr auxHdrPtr; 

TScnHdrPtr scnHdrs; 

TScnHdrPtr IdrScnHdr; 

TScnHdrPtr codeScnHdr; 

60 TScnHdrPtr dataScnHdr; 

TScnHdrPtr bssScnHdr; 

TLdHdrPtr IdHdrPtr; 

T LdSymP t r I dSyms ; 

6 5 long symCnt; 
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IdRel ocs; 


long 


relCnt; 


byte 


*ldStr i ngs ; 


long 


codeVAddr; 


long 


codeLen; 


by t e 


*codeStart ; 


long 


dataVAddr; 


long 


dataLen; 


byte 


*dataStart; 


long 


packedDataLen; 


byte 


*packedDataStart 


byte 


*bssStart; 


long 


bssVAddr; 


long 


bssLen; 


long 


dbsLen; 



TLdSymPtr remSym [3] ; 

// PEFF stuff 

Reloclnstr *relocat ions; 
long relWordCnt = 0; 



byte *stringTable; 
3 0 long stringTabSi ze - 0; 

long stringTabOf f ; 

^define kNumSpclSect ions 3 
Sdefine kMaxLoadableSect i ons 2 
3 5 ^define kMaxPeff Sect ions 3 

int NunPef f Sect ions; 
int NumLoadableSect i ons; 

int nextSectWLin; 

40 int codeSectNLm = -1; 

int dataSectNun = -1; 

int loaderSectNum = -1; 

FileHeader fileHdr; 
45 Sect ionHeader sections [kMaxPeff Sect i ons] ; 

byte strings [20]; 

LoaderHeader IdHdr; 

LoaderRelExpHeader loaderHdrsAry [kNumSpclSect i ons + kMaxLoadableSect i ons3 ; // 
50 -3,-2,-1,0,1,2 

LoaderRelExpHeaderPtr loaderHdr = & loaderHdrsAry [kNunSpc ISect ionsj ; 

LoaderlmportFi lelOPtr importFiles; 

Loader ImportPtr import Symbols; 

long import Next; 
55 HashSlotEntryPtr exportHashSlot ; 

HashChainEntryPtr sortedHashChain, exportHashChain; 

LoaderExportPtr sortedExportSyrrbots, exportSymbols; 

long export Next; 

long sortedExportNext; 

6 0 

long "XCOFFSyinlndexToPeff Import NLinber; 
long lastFi leAddr; 
65 Boolean gPackecData; 
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Boolean optNuUPad; 
Boolean optBSSExpand; 
Boolean optKeepHIT; 
Boolean optGlobal Str i ngTab; 
5 Boolean optNoPackDara; 

Boolean debug; 

byte *expF i leName; 
10 byte *outFi leName = (byte *) "a.pef"; 

^define kOefCreator '???\?' 
#define kDefType 'APPL' 
unsigned long gFileCreator = 0; 
15 unsigned long gFileType = 0; 

Flnfo gFinderlnfo; 

Boolean entryUsed, ini tUsed, terfrtJsed; 

2 0 byte *entryPoint; 

long entryPointNameLength; 
byte *ini tPoinc; 
long ini tPointNameLength^- 
byte *termPoint; 

25 long rermPointNameLength; 



byte *libInfo ClOO); 
3 0 int I ibOpts; 

long pef fCodeVAddr; 
long pef f DataVAddr; 

3 5 typedef struct i 

long format; 
byte *otdname; 
byte *name; 
long seen; 
40 long offset; 

int class; 
} AddExpType; 
#define kAbsolute 0 
#define kEquivalence 1 



45 



AddExpType *addExpAry; 
long addExpCnt; 



typedef struct Weaklmport C 
50 struct Weaklmport *next; 

byte *UbName; 

byte *name; 
> Weaklmport; 

55 Weaklmport *WeakImpor tL i st ; 

int AddWeaklmport ( byte *libName, byte *name ); 
int CheckWeak ( void ); 

void ChecklmportsForUndefOK ( int filelndex ); 
60 int f i leCnt; 

byte *gToolName; 

#ifdef JMA /* for simple AIX testing 0*N*L*Y 

65 #include "hash.c" 
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# include "myMacL i b. c" 
#endf f 

^ main (int argc, byte **argv) 

long len; 
byte *buf; 
int err; 

long sectionlndex; 
10 byte *libName; 

byte *ptr; 

byte *inFi leName; 

int t ; 

15 gToolName = PartialName < argv CO] ); 

for (T = 1; 1 < argc; i+*) { 

if (argv [i) CO) »= '-') C fileCnt-^*; inFileName = argv Ci]; continue- > 
switch (argv [i] CU) C 
20 case 'a' : if (i + l >= argc) break; 

ALIGNMENT = Getint ( argv C++i], 0, 0 ); 
cont inue; 
case 'b' : optBSSExpand = TRUE; 

cont inue; 
case 'c' : if (i+1 >= argc) break; 

currentVersion = Getint ( argv C**i3, 0, 0 ); 
cont inue; 
case 'd' : if (i+1 >= argc) break; 

oldDefVersion = Getint ( argv C++i], 0, 0 ); 
cont inue; 
case 'e' : if (i+1 >= argc) break; 

entryPoint = argv C++i]; 

entryPointNamelength = strlen ( (char*) entryPoint ); 
cont inue; 
case 'f : if (i + l >= argc) break; 

if ((argv Ci] C2] ) == 'c') /* file creator */ 

convertStringZLong C(byte*)argv [++i3, & gF i I eCreator) ; /• yuck' is there 
better way to do this? */ 

else if ((argv Ci] C23 ) == 't') /* file type */ 

convertString2Long ((byte*)ar9v C+'^i), & gFileType); /* yuck' is there a 
better way to do this? */ 
else 

expFi leName = argv C++i]; 

continue; 

case 'g' : optGlobalStringTab = TRUE; 

continue; 
case M' : if (i + 1 >s argc) break; 

initPoint ~ argv C++i); 

ini tPointNameLength = strlen ( (char*) initPoint ); 
continue; 

case M' : printf C"# %s: Bad switch -I (upper-case- i ) Did you mean 

lower-case-L instead?\n", gToolName); 
return 1; 
case 'k' : optKeepMIT = TRUE; 
continue; 

case 'L' : 

case M' : if (i + l >= argc) break; 

tiblnfo CtibOpts++] = argv C++i); 
// format: -I I i bName C=renamel C#vU- v2] J C! } 
cont i nue; 
case 'n' : optNullPad = TRUE; 

continue; 
case 'o' : if (i+1 >= argc) break; 

outFi leName = argv C+-^i3; 
cont inue; 
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se 'r' : dataSection = kConstantSection; 

cone i nue; 
se 's' : if >= argc) break; 

shareOption = Getint ( argv 0, 0 ); 

cont i nue; 
se 't' : if (i+1 >= argc) break; 

termPoint = argv C-^+il; 

termPointNameLength = strlen ( (char*) termPoint ); 

cont i nue; 
se 'u' : if <i*l >= argc) break; 

oldlmpVersi on = Getint ( argv C+>i}, 0, 0 ); 

cont i nue; 
se 'v' : if (i+1 >= argc) break; 

sectionlndex = Getint ( argv & ptr, 0 ); 

if ( *ptr++ != ) break; 

switch (sectionlndex) { 

case 0 : pef f CodeVAddr = Getint ( ptr, 0, 0 ); continue; 
case 1 : pef fDataVAddr = Getint ( ptr, 0, 0 ); continue; 
> 

break; 

se 'w' : if <>+! >= argc) break; 
libName = argv C++i}; 
ptr = libName; 

while ( *ptr && *ptr != ) ptr++; 

if (*ptr >= ' : ' ) break; 
*ptr++ = 0; 

AddUeak Import ( libName, ptr ); 

continue; 
se 'x' : optNoPackOata = TRUE; 

continue; 
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help: print 
print 

hex 100>\n" 
print 
print 
print 
print 
print 
print 
print 
print 
pr i nt 
pri nt 
print 

(def =remQve 
print 
print 
print 
print 
print 
print 
print 
print 

(def =nonut I 
print 
print 
print 

(def =contex 
print 
print 
print 
print 

to be undef 



'# /Is: usage: %s C options ) [ -o out file ] infiLe\n", gToolName, gToolName); 
'# (numbers arguments are read in decimal unless preceed by Ox ie:0x100 = 

'# options:\n"); 



-f 



-k 



# 

# 
); 

# -I 

# 
# 

'# -n 

); 

# -o 

# -r 

# -s 
are)\n"); 
'# -t 

-u 

-V 

'# -w 
runtime)\n"); 



# align code & data secitons to # byte boundary (def=8)\n*') ; 
expand bss into zeroed data (def =no)\n" ) ; 

# set current-version to # (def=0)\n"); 

# set definition version to # (def =0)\n'*) ; 

name set entry point to routine. name (def =what-XCOFF - says)\n" ) ; 
fname use file fname to load additional export list\n"); 
fc name set the creator of the output file\n"); 
ft type set the file type of the output file\n"); 

i name set initialization routine to procedure name (def =none)\n*' ) ; 
keep init,term & main routines as export symbols 

otdname [=newname] [#vers [-upf v] ] [!} [-]\n"); 

change imported library with name oldname : \n") ; 
=:newname to newname\n"); 

#vers to current- version vers requi red)\n") ; 
-upfv to up-from- vers ion upfv\n'0; 
! to i nit before\n"); 

to weak importXn"); 
pad exported loader symbols with trailing null 

fname set the target output file name (def =a,pef )\n"); 

mark data section as read-only (def =read- wr i te)\n" ) ; 

# set share option of data section to # 

set termination routine to procedure name (def =none)\n" ) ; 
set implementation version to # (def=0)\n"); 
make section #1 have default virtual address #2 (def=0)\n"); 
I ibName: ImpName make import impName from library libName weak (ok 



name 
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printf (•*# -X disable automatic data section pack ing\n'») ; 
printf C"# %s version 2.13 Cb6)\n», gToolMame); 
return 1; 

5 

if (expFi leName && fileCnt == 0) ( ) 
else if (fileCnt != 1) goto help; 

U\f MAC 

10 SpinCursor ( 32 ); 

#endif 

if (expFi leName) < 

err = ProcessExportL i st (); 
15 if (err) return 1; 

> 

#if MAC 

SpinCursor ( 32 ); 
20 #endtf 

if (fileCnt) ( 

err = OpenRead ( inFi leName, & buf, & len ); 
if (err) return 1; 

err = CheckXCOFF ( buf ); 

if (err) ( printf { '»# Xs: '%s' not XCOFF executable file format\n"- gToolHame argv 
C13 ); goto EBOT; > 



25 



3 0 err = FindStuff ( buf ); 

if (err) goto EBOT; 

> 

*^if MAC 

3 5 SpinCursor ( 32 ); 

Jl^endif 

ComputeSect ions (); 

4 0 #if MAC 

SpinCursor ( 32 ); 
#endif 

err = SymCvt ( IdSyms, symCnt ); 
45 if (err) goto EBOT; 

#if MAC 

SpinCursor ( 32 ); 
#endif 



50 



err = SymSort ( ) ; 
if (err) goto EBOT; 



#if MAC 

5 5 SpinCursor ( 32 ); 

#end i f 

if (fileCnt) < 

err = RelCvt ( IdRetocs. relCnt ); 
oO if (err) goto EBOT; 

> 

#if MAC 

SpinCursor ( 32 ); 
5 5 #endif 
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if ( dataSectNum >= 0 » optBSSExpand && ! optNoPackData ) 
gPackedData = T ryPack i ngOata C); 

#if MAC 

5 SpinCursor C 32 ); 

#endTf 

ComputeOf f sets C); 

10 FiUHeaders ( inFileHame ); 

#if MAC 

SpinCursor ( 32 ); 
#endi f 



15 



45 



err = WriteFile ( outFileName ); 
if (err) goto EBOT; 



#if MAC 

2 0 SpinCursor ( 32 ); 

#endi f 

return 0; 

2 5 E80T : 



prtntf ("# 51s: 'V.s' not converted\n", gToolName, i nF i leName) ; 
return 1; 
> /* main C) */ 

30 

V 

/* The Mac OS does not allow you to seek past EOF. I put in this routine to figure */ 
/* out if we're trying to do so, and write an appropriate ntmber of bytes of zero */ 

3 5 /* instead. */ 

/* 

/* *********************************** *^ 

void my_SetFPos (short refNum, short mode, long offset) 
C 

4 0 #ifdef MAC 

#pragma unusedCnrtode) 
byte zero = 0; 
I ong i ; 

long curLength; 



/ 



GetFPos (refNLm, &curLength); /* get current size of file */ 



if (curLength < offset) 
i 

50 for (i = curLength; i < offset; i++) 

Write ( refNum, &zero, 1); 

> 

return; 

55 } 

#else 

SetFPos (refNtm, mode, offset); 
#endi f 
) 

60 

Boolean TryPackingOata ( void ) 
C 

byte *block; 
6 5 long length; 
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byte *target, *end; 
OSErr err; 

block = dataStart; 
5 length = dataLen;' 

target = (byte *) NewPtr ( length + 10 ); 
if ( ! target ) C 

10 return'pA^SE-^'^' ^""^'^ allocate target buffer (%d bytes)\n». gToolName. length ); 

end = target + length; 

err = PackData ( block, block ♦ length, target & end )• 
15 if < err ) < ' . 

printf C"# %s: Packed Data not smaller, not pack i ng. . . \n" gToolName)- 
return FALSE; 

> 

else i 

2 0 byte *compBuf; 

register int i; 

// printf ( "Unpacked Size = %d\n", length ); 

// printf ( "Packed Size -== y^\n", gPackedSize ); 

// printf ( "Actual Packed Size = %d\n", end - target ); 

corr^Buf = (byte •) NewPtr ( length ); 
if ( r compfiuf ) { 

. ^P''^"^^ C"* '^s: Could not allocate comparison buffer (%d bytes)\n", gToolName 
-J u length); 

return FALSE; 

> 

^ r- ~ UnpackData ( target, end, compBuf, conpBuf + length )• 

JO if ( err > < 

printf ("# %s: Error unpacking data for verification; not packing data\n" 
gToolName); ' 
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return FALSE; 
> 



for { i = 0; i < length; i++ ) < 
if ( compBuf [i] != block Ci] > C 

printf ("# %s: Verify error at byte offset %x; not packing data\n", gToolName i)- 
return FALSE; » 

45 > 

> 

> 



packedDataStart = target, 
packedDataLen = end - target; 
dataSection = kPIDataSecti on; 

return TRUE; 
> /* TryPackingData () */ 



void ComputeSections ( void ) 

nextSectNum = 0; 

lastFiteAddr = sizeof fileHdr; 

if ( codeLen ) C 

codeSectNun = next Sec tNurm-+ ; 
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lastFileAddr += sizeof C Sect i onHeader ) ; 



dbsLen = dataLen + bssLen; 
5 if ( dbsLen ) ( 

dataSectNum = nextSectNLm^*; 
lastFileAddr += sizeof ( Sect i onHeader ) ; 

> 

10 loaderSectNum = nextSectNufT>*-+; 

lastFileAddr sizeof (Sect ionHeader) ; 



if ( optGlobalStringTab ) 

lastFileAddr += sizeof strings; 

> /* ComputeSecrions () */ 



long Align ( long offset, long aln ) 
20 ( 

int left = offset & Caln-1); 
if (! left) return offset; 
return offset + aln - left; 
) /* Align () */ 

int ComputeOf f sets ( void ) 
long i; 

3 0 long loaderSectionSize; 

if (dataSectNLTTi >= 0) i 

loaderHdr [dataSectNLin] . nLrrflel oca t ions = relUordCnt; 

__ loaderHdr [dataSectWun] . relocat i onsOf f set = 0- 

35 > 



for (i = kReExport Import ; i < kMaxLoadableSections; i++) 

if (loaderHdr [i], numRe L oca t ions) // loaderHdr [i). nwnExports) 

IdHdr. nLnSections++; 



IdHdr. relocat i onsOff set = 

sizeof (LoaderHeader) + 

IdHdr. numlmportFi les * sizeof (Loader IrrportF i LeID ) + 

^ c- IdHdr. nLmlmportSyms * sizeof (Loaderlrrport) + 

IdHdr. numSections * sizeof < LoaderRelExpHeader ) ; 



IdHdr. str ingsOf f set = 

IdHdr. relocat ionsOff set + 
relWordCnt * sizeof (Reloclnstr); 



IdHdr. hashSlotTable = Align ( IdHdr. st ringsOf f set + stringTabS i ze, A ); 

loaderSectionSize = IdHdr. hashSlotTable ^ 

IdHdr. hashSlotTabSize) * sizeof ( HashS lotEnt ry) + 
IdHdr. nunExportSyms * sizeof (HashChainEntry) + 
IdHdr. niJifxportSyms * SIZEOF_LoaderExport ; 

// NOTE : 

^- // The section nLinbers of the sections is set by the macro's 

O U LoaderSectNum, codeSectNumSdataSectMum 

// which are used to number the sections and set their section header order. 

// Rearange as desired by changing macros loaderSectNum, codeSectNum & dataSectNum 

// Be sure to keep the the numbers (and therefore headers) of the loadable 
^cr sections to be before the nimbers (and therefore headers) of the 

// non- loadable sections. 
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// NOTE: the order of the 3 data of the sections below determines 
// t^e fUe order of the sections 

^•^^ code that actually writes to the file peforms i ndependant I y 
// from the order of sections in the file 



10 
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// Loader Section 

sections [loaderSectNum] . conta i nerOf f set ^ Align ( lastF i leAddr, 
sections C loaderSectNum] . rawSize = loaderSectionSize; 
sections CloaderSectNum] . alignment = Log2 ( A ); 
lastFileAddr = sections CloaderSectNumJ . conta i nerOff set + 
sections [loaderSectNum], rawSize: 



8 ) 



if (codeSectNum >= 0) C 
// Code Section 
sections CcodeSectNum] 
sections CcodeSectNum] 
sections CcodeSectNum] 
sections CcodeSectNum] 
sections CcodeSectNum] 
lastFileAddr = sections CcodeSectNum] 
sections CcodeSectNum] 



containerOffset = Align ( lastFileAddr, ALIGNMENT ); 
rawSize = codeLen; 
initSize = codeLen; 
execSize = codeLen; 
alignment = Log2 C ALIGNMENT ); 

containerOffset + 
rawSi ze; 



30 



35 



40 



if (dataSectNum >= 0) ( 
// Data Section 

sections CdataSectNun] . containerOffset = Align C lastFileAddr, (gPackedData ? 4 
ALIGNMENT) ); 

sections CdataSectNum] . rawSize = gPackeci)ata ? packedDataLen : 

(optBSSExpand ? dataLen + bssLen : dataLen)* 
sections CdataSectNum]. initSize = (optBSSExpand ? dataLen + bssLen : dataLen)- 
sections CdataSectNum]. execSize = dataLen + bssLen; 
sections CdataSectNum]. alignment = Log2 ( ALIGNMENT ); 
lastFileAddr = sections CdataSectNum], containerOf f set' + 
sections CdataSectNun] . rawSize: 

> 

return 0; 
> /* ComputeOf fsets () */ 
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int FillHeaders ( byte *inFileName ) 

static byte sectionNames [] = "code\Odata\Oloader"; 
movebytes ( sectionNames, strings, sizeof sectionNames ); 



fileHdr. magici 

fileHdr. magic2 

fileHdr. filelypelD 

fileHdr. archi tecturelD 

fileHdr. vers i onNtmber 

#if MAC 

fileHdr. datet imeStamp 

GetFileTime ( inFileName ); 
#else 

fileHdr. dateT imeStamp 
#endif 

fileHdr. oldDef Version 

f i leHdr . oldlmpVersion 



= peff Magici; 
= peffMagic2; 
= peffTypelD; 
= PowerPC ID; 
= kPEFVersion; 



= fileHdrPtr -> fhTirrOat ? fiteHdrPtr -> fhTirrOat : 
= MacTime ( fileHdrPtr -> fhTirrOat ); 



oldDefVersion; 
oldlmpVersion; 



// option 
// option 
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fUeHdr. currentVersion = currentVersion; // -- option 

fileHdr. numberSect ions = nextSectNum; 

fileHdr. loadableSect i ons = nextSectNum - 1; // loader section not "loadable" 

5 sections [ t oaderSec tNum) . sectionName = optGlobalStringTab ? 10 : -1; 

sections C I oaderSec t Muni . sect i onAddress = 0; 

sections C I oaderSec tNum] . regionKind = kLoaderSect i on; 

sections C loaderSectNum] . shareKind = kGlobalShare; 

10 if (codeSectMum >= 0) ( 

sections CcodeSectMum) , sectionName = optGlobalStringTab ? 0 : -1; 
sections CcodeSectNLffn] . sect i onAddress = pef f CodeVAddr ; // option 

sections tcodeSectNum] . region<ind = kCodeSect i on; 
sections CcodeSectMum] - shareKind = kGlobalShare; 

15 > 

if (dataSectNum >= 0) C 

sections CdataSectNun] . sectionName = optGlobalStringTab ? 5 : -1; 

sections [dataSectNum]. sect i onAddress = pef fDataVAddr; //'-- option 

20 sections [dataSectNum]. regionKind = dataSection; // option 

sections [dataSectNun] . shareKind = shareOption; // option 

loaderHdr [kReExport Import] . sect ionNumber = kReExport Inport ; 
25 loaderHdr [kPhysicalExport] . sect i onNijnber = kPhysicalExport; 

loaderHdr [kAbsoluteExport] . sect i onNLmrtier = kAbsotuteExport; 
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if CcodeSectMum >= 0) 

loaderHdr [codeSectNLffn] . sect i onNumber = codeSectNun; 

if (dataSectNum >= 0) 

loaderHdr [dataSectNum]. sect i onNumber = dataSectNum- 



return 0; 
35 > /* Fi LlHeaders () */ 

int UriteFile < byte *outFileName ) 
< 

4 0 short refNLfn; 

I ong i ; 
int err; 
Str255 pStr; 

^5 err = MakePStr ( outFileName, strlen ( (char*) outFileName ), pStr, sizeof pStr ); 

if (err) return err; 

err = HCreate ( 0, 0, pStr, kDefCreator, kDefType ); 
if (err < 0) C 
50 #ifdef MAC 

if (err != dupFNErr) < /* duplicates are ok! ? */ 
#endif 

printf (»# %s: Error creating %s %d\n", gTooLName, outFileName, err); 
return -1; 

5 5 tfifdef MAC 

> 

#endi f 
> 



err = HOpenDF ( 0. 0, pStr, fsRdWrPerm, & refNum ); 
if (err < 0) C 

printf ("U %s: Error opening %s %d\n", gToolName, outFileName, err); 
return -1; 

> 
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// File Header 

Write ( refNum, (byte *) & fileHdr, sizeof fileHdr ); 
// Section Headers 

for (i s: 0; i < fileHdr. number Sec t i ons; i-^+) 
C 

Write ( refNum, (byte *) & sections Ci], sizeof ( Sect i onHeader ) ); 



10 



// Global String Table 
if ( optGlobalStringTab ) 
C 

Write ( refNum, strings, sizeof strings ); 
15 > ' 



20 



45 



50 



// Loader Section 

my_SetFPos ( refNum, fsFromStart, sections [ loader Sec tNum] , containerOf f set ); 
Write ( refNum, (byte *) & IdHdr, sizeof IdHdr ); 



// Loader RelExp Headers 

Write ( refNum, (byte *) importFiles, IdHdr. numlmportf i I es * sizeof 
( Loader ImportF i lelD) ); 

2^ Write ( refNum, (byte *) importSymbols, IdHdr. numlmportSyms * sizeof (Loader import ) ); 

for (i = kReExportlrrport; i < kMaxLoadableSect ions; < 

if ( loaderHdr [i3. numRelocations ) // \\ loaderHdr [i]. numExports ) 

Write ( refNum, (byte *) & loaderHdr [i], sizeof (LoaderRelExpHeader ) ); 

> 

Write C refNum, (byte *) relocations, relWordCnt * sizeof (Reloclnstr) ); 
Write ( refNum, (byte *) stringTable, stringTabS i ze ); 
35 my_SetFPos ( refNum, fsFromStart, sections [loaderSectNun] . contai nerOf f set + 

IdHdr, hashSlotTable ); 

// to skip any padding after string table 

Write ( refNum, (byte *) exportHashSlot, (1 « IdHdr. hashSlotTabSi ze) * sizeof 
40 (HashSlotEntry) ); 

^ Write ( refNum, (byte *) sortedHashCha in, IdHdr. numExportSyms * sizeof (HashChainEntry) 

Write ( refNum, (byte *) s or tedExport Symbols, IdHdr. numExportSyms * SI 2E0F_LoaderExport 



// Code Section 

if (codeSectNum >= 0) C 

my_SetFPos ( refNum, fsFromStart, sections [codeSectNum] . containerOf f set ); 

Write ( refNum, codeStart, codeLen ); 

> 



// Data Section 
55 if (dataSectNum >= 0) C 

my_SetFPos ( refNum, fsFromStart, sections CdataSectNuml . containerOf fset ); 
if ( gPackedData ) 

Write ( refNum, packedOataStart , packedDataLen ); 
else 

^0 Write ( refNum, dataStart, dataLen ); 

if (optBSSExpand) 
i 

Write ( refNum, bssStart, bssLen ); 

65 
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25 



35 



50 



60 



// End of file padding 

if ( lastFileAddr != Align ( I est F i I eAddr , ALIGNMENT ) ) { 

iTTy_SetFPos ( refNum, fsFromStart, Align ( LastFileAddr, ALIGNMENT ) - 1 ); 
Write ( refNum, (byte *) 1 ); 

} 



#if MAC 

SetEOF ( refNum, Align ( lastFileAddr, ALIGNMENT ) ); 
iHi f 



10 #endif 

FSClose C refNum ): 



15 /* 

if creator/f i letype were specified, set 'em now.. 

*/ 

if (gFi leCreator |[ gFileType) 
( 

2 0 #if MAC 

Str255 pStr; 



err = MakePStr ( outFileName, strLen ( (char*) outFileName ), pStr, sizeof pStr ); 
if (err) return err; 



err = HGetFInfo ( 0, 0, pStr, & gFinderlnfo ); 
if ( err ) < 

printf ("it Xs: cannot get file type and creator info on Xs (%d)\n", gToolName, 
outFileName, err); 
30 return 1 : 

> 



if ( gFi leCreator ) 

gFinderlnfo. fdCreator = gFi leCreator; 

if ( gFileType ) 

gFinderlnfo. fdType = gFileType; 



err = HSetFInfo ( 0, 0, pStr, & gFinderlnfo ); 
40 if ( err ) C 

printf ("# %s: cannot set file type and creator info on %s (%d)\n", gToolName, 
outFileName, err); 
return 1; 

> 

4 5 #else 

printf ("# %s: cannot set file type and creator on this system\n". 



gToolName); 
#endif 
> 

return 0; 
> /* WriteFile () */ 



5 5 byte *SkipCount ( byte *thisChar, int liblndex, int offset, int *length ) 
< 

int skipCount; 



for (SkipCount = 0; skipCount < (3 + liblndex*3 + offset); skipCountf+) < 
while (*thisChar != 0) thisChar++; /* Skip the current string */ 
thi sChar-t-^; /* Get to the start of the next string */ 
> 

♦length = strlen ( (char*) thisChar ); 
return thisChar; 
6 5 > /* SkipCount () */ 
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45 



byte *GecLibNafne ( byte *thisChar, int libindex, int *tength ) 

static byte name -[256]; 
byte *f i leM, *membN; 
int fLen, mLen; 

fileN ' SkipCount C thisChar, liblndex, 1, & fLen ); 
membN = SkipCount C thisChar, tiblndex, 2, & mLen ); 

if (! mLen) C 

*length = fLen; 
return fileN; 

> 

movebytes C fileN, name, fLen ); 
name [fLen] = ' : ' ; 

movebytes ( membN, name* f L en* 1 , mLen ); 
name Cf Len+1+mLenl = 0; 

•length = fLen + 1 + mLen; 
return name; 
> /* GetLibName () */ 



10 



15 



20 



25 

GetName ( TLdSymPtr sym, byte **name, int •nameLength ) 
i 

int I en; 

30 if ( sym -> IsZeroes != 0) < 

byte *ptr = Cbyte *) sym -> IsName; 
*name = ptr; 
I en = 0; 

while (len < 8 && *ptr) < ptr++; lerf^-+; > 

35 > 

else C 

*name = IdStrings + sym -> IsOffset; 
len = St r I en ( (char*) *name ); 

> 



*nameLength = len; 

return 0; 
> /* GetMame () */ 



SizeString < int length. Boolean needNull ) 
C 

stringTabSi ze += length; 
50 if ( needNull || optNullPad ) 

stringTabSize-^+; 

> /* SizeString () */ 

55 AddString ( byte *name, int length. Boolean needNull ) 
C 

movebytes C name, stringTable stringTabOf f , length ); 
stringTabOff length; 
if ( needNull ;| optNullPad ) 
SO StringTabOf f++; 

> /* AddString (> */ 



long MapSecNumToPef f C long secNm, long *defAddr ) 
65 ( 
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i f (secNm < 0) < 
*defAddr = 0; 
return secMm; 

> 

5 

if (secNm == secNum CCODE INDEX]) C 
*defAddr = codeVAddr; 
return codeSeccNum; 

) 

10 if (secNm == secNum [DATA INDEX]) ( 

*defAddr = dataVAddr; 
return dataSectMum; 

) 

if (secNm == secNum tBSSINDEX]) < 
15 // the next line should result in subtracting out the start of bss, 

// and adding in the length of data to the symbol's address, 
// however, since XCOFF always places bssVAddr = dataVAddr * dataLen; 
// this line could be reduced to *defAddr = dataVAddr; 
*defAddr = bssVAddr - dataLen; 
2 0 return dataSectNum; 

> 

return -1; 
) /* MapSecNunToPef f C) */ 



25 



40 



45 



60 



int ClassOf ( TLdSymPtr sym ) 

<: 

switch < sym -> IsSmClass ) C 



case XMC_PR : return kCodeSyirbot ; 

3 0 case XMC_SV : /* fall thru */ 

case XMC_DS : return kTVectSynijol ; 

case XHC_TC : return kTOCSymbol; 

default : return kDataSymbol ; 

> 

35 } /* ClassOf () */ 



Boolean I sVa I idlmport ( TLdSymPtr sym ) 
i 

/* 

don' t remove import symbols - 

if ( sym remSym CO] ) return 0; 

if ( sym == remSym CI] ) return 0; 

if ( sym == remSym [2] ) return 0; 



return IsLdlmport ( *sym ) 

sym - > I s I F i I e > 0 && 



sym -> IsIFile <= IdHdr. nLinlmportF i les; 
50 > /* IsValidlmport () •/ 



Boolean I sReExport Import ( TLdSyTT#>tr sym ) 

5 5 // remove export symbols only 

if ( sym == remSym CO] ) return 0; 

if ( sym == remSym [1] ) return 0; 

if ( sym == remSym C2] ) return 0; 



return IsLdExport ( *sym ); 
> /* IsReExportlmport <) */ 



Boolean I sVal idExport ( TLdSymPtr sym ) 
65 < 
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// remove export symbols only 
if ( sym -= remSym CO} ) return 0; 
if ( sym == remSym CD ) return 0; 
^ if ( sym == remSym C2] ) return 0; 

return IsLdExport ( *sym ) && 

(sym -> IsScNum == secMum CCODEIMDEX) 
sym -> IsScNl»t» == secNum CDATAINOEX] || 
sym -> IsScNum == secNum CBSSINDEX]); 
10 > /* IsValidExport () */ 

AddExport ( long seen, long theAddr, byte •name, int nameLength, int symClass ) 
C 

15 exportSymbols [exportNext] . symClass = symClass; 

exportSymbols CexportNext] . nameOffset = stringTabOf f ; 
exportSymbols CexportNext]. sect lonNunber = seen; 
// loaderHdr Csecn] . num£xports++; 

exportHashChain CexportNext]. hashword = CFLHash ( name, namelength ); 
20 exportSymbols CexportNext]. address = theAddr; 

exportNext**; 
> /* AddExport () */ 

2 5 Addlmport ( int XCOFF index, TLdSymPtr sym, byte *name, int namelength ) 

i 

int symClass = ClassOf ( sym ); 

XCOFFSymindexToPef f ImportNuTt>er CXCOFFindex} = importNext; 
30 importSymbols CimportNext] . symClass = symClass; 

inportSymbols CimportNext] . nameOffset = str ingTabOf f ; 
// if both import & export, add to both 
// tables, but only one string table entry 
if ( I sReExport Import ( sym ) ) 

3 5 AddExport ( kReExport Import , importNext, name, namelength, symClass ); 

importNext++; 
) /* Addlmport ( ) */ 

4 0 int SymCvt C TLdSymPtr IdSyms, long symCnt ) 

( 

TVersion linkedVer, oldlmpVer; 
TBooLean iB; 
long file Index; 
45 long sym Index; 

byte *name; 
int nameLength; 
byte *ldFiles; 
long size; 

5 0 TLdSymPtr sym; 

long cnt; 
long seen; 
long defAddr; 
long sNum; 
5 5 TBoolean islmp; 

// set inital values to these 

if (fileCnt ==: 0 [ | auxHdrPtr -> ahEntry == -1) C 
oO IdHdr. entryPointSect ion = kNoRegion; 

> 

else { 

IdHdr, entryPointSect ion = MapSecNumToPef f (auxHdrPtr -> ahSnEntry, & defAddr); 
IdHdr. entryPointOf f set = auxHdrPtr -> ahEntry - defAddr; 

o5 > 
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IdHdr. ini tPointSection = kMoRegion; 
IdHdr. termPointSect ion = JcNoRegion; 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



// determine sizeof string table 
// first - size of import lib names 

if (IdHdrPtr) i 

/* Skip the first file string, it is the UNIX path */ 
IdHdr, numlmportFi les = IdHdrPtr -> IhNImpID - 1; 
IdFiles = (byte *) IdHdrPtr + IdHdrPtr -> IhlmpOff- 

> 

for (filelndex = 0; filelndex < IdHdr. numlmportF i les; filelndex++) C 
name = GetLibName ( IdFiLes, f i leindex , & nameLength ); 
CheckLibOpt ( & name, & nameLength, & tinkedVer, & oldlnpVer, & iB ); 
SizeString ( nameLength, TRUE ); 

> 



// next size of all symbol's names 

// and types of them - import vs. export 

for (symlndex = 0; symlndex < symCnt; symlndex++) ( 

sym = & IdSyms [symlndex]; 

GetName ( sym, & name, & nameLength ); 

// identify entryPoint, initPoint, termPoint, by name 
// as user specified options, and remove from general 
// processing of symbols. 

isimp = IsVal idlmport ( sym ); 

if (islmp && I sReExport Import ( sym ) |; I sVa I i dExport ( sym ) ) < 
if ( nameLength == ent ryPoi ntNameLength && 

comparebytes ( name, entryPoint, nameLength ) ) ( 
if ( islmp ) i 

IdHdr. entryPointSection = kReExport Irrport ; 
entryPointOf fset = sym -> LsValue; 



entryPointSection = MapSecNumToPef f 
entryPointOf fset = sym -> IsValue - 



IdHdr 

> 

else C 
IdHdr 
IdHdr 

> 

if (! optKeepMIT) remSym [0] = sym; 
entryUsed = TRUE; 
> 

if ( nameLength ini tPointNameLength && 

comparebytes ( name, initPoint, nameLength ) 
if ( islmp ) i 

IdHdr. initPointSection = kReExportlrport; 
IdHdr. ini tPointOf fset = sym -> IsValue- 

> 

else < 
IdHdr. 
IdHdr. 

) 

if (! optKeepMIT) remSym CI] = sym, 
initUsed = TRUE; 



C sym -> IsScMum, & defAddr )• 
defAddr; 



) i 



initPointSection = MapSecNumToPef f ( sym -> IsScNum, & defAddr )- 
ini tPointOf fset = sym -> IsValue - defAddr- 



if < nameLength s= termPointNameLength && 

comparebytes ( name, termPoint, nameLength ) ) C 
if ( i s I mp ) i 

IdHdr. termPointSection = kReExportlmport; 
IdHdr. termPointOf f set = sym -> IsValue-' 

) 

else <: 
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IdHdr. termPointSect ion = MapSecNumToPef f ( sym -> IsScNum, & defAddr ); 
IdHdr. termPointOf f set = sym -> IsValue - defAddr; 



if (i optKeepMIT) remSym C2] = sym; 
terrrtJsed = TRUE; 
> 

> 



10 



15 



20 



if ( IsValidlmport ( sym ) ) C 
IdHdr. ntjmlmportSyms++; 
SizeString ( nameLength, TRUE ); 
if ( I sReExport Import ( sym )) 
IdHdr. num£xportSyms++; 

else if ( IsVal idExport ( sym > ) < 
IdHdr. numExportSyms**; 
SizeString ( nameLength, FALSE ); 

> 



25 



30 



if < entryPoint && ! entryUsed ) 

printf ('•# %sz warning, the specified entry point '%s' was not found\n", gToolName 
entryPoint); 

if ( initPoint && ! initUsed ) 

printf Xs: warning, the specified init routine '%s' was not found\n", gloolName 
initPoint); ' 
if ( termPoint && ! ternllsed ) 

printf ("# y.s: warning, the specified term routine '%s' was not found\n", gToolName 
ternPoint); ' 



35 



for Csymlndex = 0; symlndex < addExpCnt; symlndex++) C 

SizeString ( strlen ( (char*) addExpAry tsymlndex] . name ), FALSE ); 
IdHdr. ntnexportSyms++; 

> 



40 



45 



50 



55 



60 
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// allocate symbol string tables and import / export tables 
stringTable = Malloc ( (size = str i ngTabSi ze) + 4 ); 
if (! StringTable) goto Mem_Error; 
clearbytes ( stringTable, size ); 
stringTabOff = 0; 

XCOFFSymlndexToPef f ImportNunber = (long *) Malloc ( size = sizeof (long) * symCnt ); 
^f (! XCOFFSymlndexToPef f ImportNumber) goto Mem_Error; 
clearbytes ( (byte •) XCOFFSymlndexToPef f ImportNumber, size ); 

importFiles = (Loaderimport F i lelOPtr ) 

Malloc ( size = sizeof (Loader ImportFi lelD) * IdHdr. nunlmportF i I es ); 
if (! importFiles) goto Hem_Error; 
clearbytes ( (byte *) importFiles, size ); 

importSyinbols = (LoaderlmportPtr ) 

Malloc ( size = sizeof (Loaderimport) * IdHdr. nLsnlmportSyms ); 
if {! importSymbols) goto Mem_Error; 
clearbytes ( (byte *) importSymbols, size ); 
importNext - 0; 

exportSymbols = ( LoaderExportPtr ) 

Malloc ( size = sizeof (LoaderExport ) * IdHdr. numExportSyms ); 
if (! exportSymbols) goto Mem_Error; 
clearbytes ( (byte *) exportSymbols, size ); 
exportNext = 0; 
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exportHashChain = ( HashChai nEntryPt r ) 

Malloc ( size = sizeof (HashChainEntry) * IdHdr. numExportSyms ); 
if (• exportHashChain) goto Mem_Error; 
5 clearbytes ( (byte *) exportHashChain, size ); 

// extract imported library symbols 

for Cfilelndex = 0; filelndex < IdHdr. numlmportFi les; f i lelndex++) ( 
name = GetLibName ( IdFiles, filelndex, & nameLength ); 
10 CheckLibOpr ( & name, & nameLength, & linkedVer, & oldlmpVer, & is ); 

importFiles [filelndex]. f i leMameOf f set = str ingTabOf f ; 
importPiles [filelndex]. I i nkedVers ion = linkedVer; 
importFiles [filelndex). oldlmpVers i on = oldlmpVer; 
importFiles [filelndex]. initBefore= i8; 
15 AddString ( name, nameLength, TRUE ); 

) 



// extract imports for each library 
2 0 for (filelndex = 0; filelndex < IdHdr. numlmportFi les; filelndex++) i 

// add imported symbols, in order of the files they belong to. 
cnt =0; 

importFiles [filelndex], impFirst = importNext; 
fof (symlndex = 0; symlndex < symCnt; symlndex++) < 
2 5 sym = & IdSyms [symlndex]; 

if (! IsVal idlmport ( sym )) continue; 
if (sym -> lsIFile-1 >= filelndex) continue; 
GetName ( sym, & name, & nameLength ); 
Addlmport ( symlndex, sym, name, nameLength ); 
0 AddString ( name, nameLength, TRUE ); 

cnt++; 

> 

importFiles [filelndex]. nunlmporis = cnt; 



35 



40 



45 
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// extract exports by section number 

for (sNun = kReExport Import ; sNum < kHaxLoadableSect i ons; sNuny*-+) < 

if (sNum != kReExportlmport) // we already started accumulating exports of type -3 

; // loaderHdr [sNum] . expFirst = exportNext; 
for (symlndex ~ 0; symlndex < symCnt; symindex++) ( 
sym = & IdSyms [symlndex]; 

if ( IsVal idlmport ( sym ) ) continue; // if it was an import it was checked 



a I ready 



// to see if it is an export 

if i\ IsVal idExport ( sym ) ) continue; 
seen = MapSecNunToPef f ( sym -> LsScNum, & defAddr ); 
if ( seen != sNum ) continue; 
GetName ( sym, & name, & nameLength ); 

AddExport ( seen, sym -> IsValue - defAddr, name, nameLength, ClassOf ( sym ) )• 
AddString ( name, nameLength, FALSE ); * 
> 

for (symlndex = 0; symlndex < addExpCnt; syml ndex++) ( 
long xc index; 

5 5 AddExpType *add = & addExpAry [symlndex]; 

switch ( add -> format ) < 
case kEqui valence : 
xc Index = F i ndXCOFF I ndexByName ( add -> oldname, IdSyms, symCnt )• 
if ( xclndex < 0 ) C 

printf ("# %s: symbol '7.s' not found for export "/.s'\n", gToolName add -> 
oldname, add -> name); ' 
return 1; 

> 

sym = & IdSyms [ xclndex ]; 

nameLength = str I en ( (char*) add -> name ); 



65 
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if ( IsVal idlmport ( sym ) ) C 
long imp Index; 

if ( kReExport Import != sNum ) break; 

implndex = XCOF F Sym I ndexToPeff Import Number C xclndex ]; 
5 AddExport ( kReExport I mport , implndex, add -> name, nameLength, CLassOf ( 

sym ) ); 

> 

else ( 

seen = MapSecNumToPef f ( sym -> IsScNum, & defAddr ); 
10 if ( seen != sNum ) break; 

AddExport ( seen, sym -> IsValue - defAddr, add -> name, nameLength, ClassOf 

( sym ) ); 
> 

AddString ( add -> name, nameLength, FALSE ); 

15 break; 

case kAbsolute : 

if ( add -> seen != sNum ) break; 
nameLength = strlen ( (char*) add -> name >; 
2Q ^ AddExport ( add -> seen, add -> offset, add -> name, nameLength, add -> class 

AddString ( add -> name, nameLength, FALSE ); 

break; 

> 

25 

CheckWeak (); 
return 0; 

30 

Mem_Error: 

printf ("# %s: Could not allocate %d bytes\n", gToolName, size); 
return 1 ; 
3 5 > /* symCvt () */ 



int SymSort ( void ) 
i 

4 0 long i; 

long s; 

long *exportLinkSlot; 
long hashStotSize; 
long size; 
4 5 long oldlndex; 

long hashSlot; 

long oldcc; 

int hsiner = 0; 

50 // compute size of hash slot table 

IdHdr. hashSlotTabSize = CFLHashSlotBi ts < IdHdr. numExportSyms ); 
top : 

hashSlotSize = 1 « IdHdr. hashSlotTabSize; 

55 exportHashSlot = (HashSlotEntryPtr ) 

Halloc ( size = sizeof (HashSlotEntry) * hashSlotSize ); 
if (* exportHashSlot) goto Mem_Error; 
clearbytes ( (byte *) exportHashSlot, size ); 

60 exportLinkSlot = (long *) 

Malloc ( size = sizeof (long) * IdHdr. numExportSyms ); 
if (! exportLinkSlot) goto Mem_Error; 
clearbytes ( (byte *) exportLinkSlot, size ); 



65 



for ( ) = 0; i < IdHdr. numExportSyms; i++) < 
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hashSlot = CFLHashSLot ( exportHashChatn Cil. hashword, IdHdr. hashS I otTabS i ze ); 
exportLinkSlot [i] = exportHashSLot ChashSlotl. chainlndex; 
oldcc = exportHashSlot [hashSlot]. chainCount; 
exportHashSlot. [hashSlot] . chainCount++; 
exportHashSlot ChashSlotl . chainlndex = i ; 
if (exportHashSlot [hashSlot], chainCount »= oldcc + 1) ( 
if ( ■^+hsincr > 3) ( 

printf C"# /Is: hash chain count overflow (qui 1 1 i ng)\n*\ gTool Name) ; 

return -1; 



printf ("# %s: Hash chain count overflow, using more slotsNn", gToolName); 
IdHdr. hashSlotTabSize++; 
Free ( (byte *) exportHashSlot ); 
Free ( (byte *) exportLinkSlot ); 
15 goto top; 

) 

i f < exportHashSlot [hashSlot] . chainlndex » = i ) < 

printf ("# V.S: hash slot value overflow (too many exports, qui tting)\n'*, 
gToolName); 

2 0 return -1; 

> 

> 

sortedExportSymbols = (LoaderExportPtr) 
25 Malloc ( size = sizeof (LoaderExport ) * IdHdr. ni^rtxportSyms ); 

if (! sortedExportSymbols) goto Mem_Error; 
clearbytes ( (byte *) sortedExportSymbols, size ); 

sortedHashChain = (HashChainEntryPtr) 

3 0 Malloc ( size = sizeof (HashChainEntry) * IdHdr. ntrfxportSyms ); 

if (! sortedHashChain) goto Mem_Error; 
clearbytes ( (byte *) sortedHashChain, size ); 

sortedExportNext = 0; 
3 5 // convert exportHashSlot in place, 

// put map of old new into indexMap 
for (s = 0; s < hashSlotSi ze; s++) < 

oldlndex = exportHashSlot [s] . chainlndex; 

exportHashSlot [s] . chainlndex = sortedExportNext; 

40 

for (i = 0; i < exportHashSlot [s] . chainCount; i+^-) { 

sortedExportSymbols [sortedExportNext] = exportSymbol s [oldlndex] ; 
sortedHashChain [sortedExportNext] = exportHashChai n [oldlndex]; 

sortedExportNext-»-+; 
45 oldlndex = exportLinkSlot [oldlndex]; 



Free ( (byte *) exportLinkSlot ); /* Mwow, we almost care about memory disposal!' 
5 0 return 0; 

Hem_Error : 

printf ("# %s: Could not allocate %d bytes\n". gToolName, size); 
return 1; 
5 5 ) /* SymSort () */ 



int AddWeaklmport ( byte *libName, byte *name ) 
C 

6 0 Weaklmport *ptr = (Weaklmport *) NewPtr ( sizeof (Weaklmport) ); 

if ( ! ptr) return -1 ; 

ptr -> libName = libName; 

ptr -> name = name; 

ptr -> next = Weak ImportL i st ; 
65 UeaklmportList = ptr; 
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rerurn 0; 
) /* AddWeak Import () */ 



b int CheckWeak ( void ) 

{ 

Weak import *ptr; 
long start, end; 
L ong f i I e 1 ndex ; 
10 int libNameLen; 

int impNameLen; 



15 



for (ptr = UeaklmportL i St; ptr; ptr = ptr -> next) ( 
start = end = 0; 

libNameLen = strlen ( (char *) ptr -> libName ); 
for ( fi lei ndex = 0; fi lei ndex < IdHdr. nunlmportF i les; filelndex*+ ) C 



impNameLen - strlen ( (char *) (stringTable + importFiles Cfilelndexl. 
2 0 f 1 leNameOf fset) ); 

if ( impNameLen != libNameLen [[ 

! comparebytes ( stringTable* importFiles [f i lelndex] . f i leNameOf f set , 
ptr -> libName, 
libNameLen ) ) continue; 

25 

start = importFiles [fi lelndex]. impFirst; 

end = start + importFiles Cfi lelndex]. numlmports; 

for (; start < end; start++ ) C 

if ( strcmp ( (char*) (stringTable + importSyn^ols [start]. nameOffset), 
30 (char*) ptr -> name) »= 0 ) 

cont inue; 

importSymbols [start]. symClass [= kUndeflmpOK; 
// we found the syirbol whose name matches, so stop! 
break; 

35 > 

// we found the file whose library name matches, so stop! 
break; 

> 

40 if (start >= end) 

printf ('•# %s: warning, symbol '%s' in file 'Xs' not found\n", gToolName, ptr -> 
name, ptr -> libName); 
> 

4 5 for ( fi lelndex = 0; fi lelndex < IdHdr. numlmportFi les; f i I e I ndex** ) < 

if ( importFiles [fi lelndex]. initBefore & kWeaklmport ) 
ChecklmportsForUndefOK C fi lelndex ); 

> 

5 0 return 0; 

> /* CheckWeak () */ 

void ChecklmportsForUndefOK ( int fi lelndex ) 
55 i 

long start, end; 

start = importFiles [fi lelndex], impFirst; 

end = start * importFiles [fi lelndex]. numlmports; 

6 0 for (; start < end; start** ) < 

if ( importSymbols [start], symClass & kUndeflmpOK ) continue; 

printf ("# %&i warning, symbol '%s' is not marked as weak, but its library \"%s\" 
is.\n", 

gToolName, 

o5 StringTable * importSymbols [start]. nameOffset, 
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stringTabte + importF i les Cf i lelndex] . f i leNameOf f set ) ; 

> 

) /* ChecklfiportsForUndefOK () */ 



long RelocAddr ( TLdRelPtr reloc ) 
{ 

long relpos = (long) reloc -> IrVAddr; 

10 if (reloc -> IrRSecNm == secNum [DATAIMOEX]) ( 

relpos -= dataVAddr; 

> 

else if (reloc -> IrRSecNm secNum [BSSINDEX]) ( 
relpos -= bssVAddr; 
15 relpos += dataLen; 



20 



25 



30 



return relpos; 
) /* RetocAddr () */ 



long Reloclndex ( TLdRelPtr reloc ) 
{ 

long rindex = reloc -> IrSymNdx; 

if (rindex == BSSINDEX) 
rindex = OATAINDEX; 
return rindex; 
) /* Reloclndex () */ 



int DataAdjust ( TLdRelPtr reloc. long count ) 
< 

whi le (count > 0) < 

3 5 long rindex = reloc -> IrSymNdx; 

long *rAddr; 

if (reloc -> IrRSecNm != secNum [DATAINDEX] && 
reloc -> IrRSecNm 1= secNum tBSSINDEX]) C 

4 0 printf ("a Xst not converting relocation; relocations not allowed in code\n", 

gToolName); 

return 1; 

> 

4 5 if (rindex =- CODE INDEX) { 

if (reloc -> IrRSecNm == secNum [BSSINDEX]) { 
if (! optBSSExpand) ( 

errmsg : printf ("# %s: relocation of BSS to BSS not convertable into PEF format without 
-b option\n", gToolName); 

5 0 printf ("# item at OxXOSlx in BS$\n'\ reloc -> IrVAddr); 

printf (*'# relocate item into (initialized) DATA to properly 

convert\n"); 

return 1 ; 

> 

55 rAddr = (long *) (bssStart + RelocAddr ( reloc ) - dataLen); 

> 

else < 

rAddr = (long *) (dataStart RelocAddr ( reloc ) ); 

> 

60 *rAddr -= codeVAddr; 

*rAddr +- peff CodeVAddr; 

> 

else if (rindex == DATAINDEX) < 

if (reloc -> IrRSecNm == secNum [BSSINDEX]) ( 

6 5 if (! OptBSSExpand) goto errmsg; 
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rAddr = (long *) (bssStart RelocAddr ( retoc ) - dataLen)- 

> 

else ( 

rAddr = (long *) (dataStart + RelocAddr ( reloc ) )• 
5 ) * 

* rAddr -= dataVAddr; 

* rAddr += pef f DataVAddr; 

) 

else if Crindex == BSSINDEX) ( 
10 r Index = DATA INDEX ; 

if (reloc -> lrRSccNm== secNum CBSSINDEXJ) i 
if (! optBSSExpand) goto errmsg; 

rAddr = (long ♦) (bssStart + RelocAddr ( reloc > - dataLen); 

15 else ( 

rAddr = (long *) (dataStart + RelocAddr ( reloc > )• 

> 

*rAddr -= bssVAddr; 
* rAddr ■»-= dataLen; 
2 0 *rAddr pef fOataVAddr; 

> 

count--; 
reloc++; 

> 

2 5 return 0; 

> /• DataAdjust () */ 



30 



long index = 0; 



^define IcRptBufSize 8 // keep power of 2 

#define kRptBufMask (kRptBuf Size- 1 ) 
typedef Reloclnstr TRptBuf CkRptBuf Si ze] ; 
TRptBuf rptBuf; // last n words of relocation 

3 5 int rptOist; // match found how far back? 

int rptRest; // riLinber of instrs in current partial match 

int rptRunCnt; // number of complete matches 



40 void PrtReloc ( Reloclnstr w1 ) 
< 

relocations [relWordCnt++] = w1; 
> /♦ PrtReloc () */ 



45 



CheckRelocMatch ( Reloclnstr w1 , int matchStart ) 



int i; 

for (i = matchStart; i <= kRptBuf Size; i+*) i 
50 if (wl == rptBuf C relWordCnt-i & kRptBufMask ]) C 

rptOist = i; 

rptRest =1; // first word of match 

break; 

55 

> /* CheckRelocMatch C) */ 



void RestartMatch ( Reloclnstr wl ) 
60 < 

int re I Copy = relWordCnt; 

int rptRunCopy = rptRunCnt; 

int rptRestCopy = rptRest; 
^ int rptOistCopy = rptDist; 

o5 TRptBuf rptCopy; 
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// don' t etni t a RPT 1 , \ 
if (rptRunCnt == 1 && rptOist == 1) { 
rptRunCopy = rptRunCnt = 0; 
5 rptRestCopy = rptRest = 1; 

> 

// output completed runs; set rptRunCnt to 0 
if (rptRunCnt > 0) < 
10 if (rptRunCnt <= ksRPTMAX) < 

PrtReloc ( (krRPT « RELOPSHFT) + (rptDist- 1<<8) * rptRunCnt - 1 ); 

> 

else <: 

PrtReloc ( (krLRPT << RELOPSHFT) + ( rptD i St - 1 <<6) + (rptRunCnt » 16) ); 
15 PrtReloc ( rptRunCnt & 65535 ); 

> 

) 

// in case of an incomplete run, which we will want to add to thru 
2 0 // repeat buffer, save the repeat buffer to know what the partial 

// sequence was 

movebytes ( (byte *) rptBuf, (byte *) rptCopy, sizeof rptCopy ); 

// if a run was sent, then clear the match buffer 

2 5 // otherwise, don't 

if (rptRunCopy) < 

clearbytes ( (byte *) rptBuf, sizeof rptBuf ); 

rptDistCopy =1; // and set the distance of first ok match to 1 

// (no match will be found, however, AddReloc 

3 0 //will know that a sequence is being restarted 

// when it sees that matchStart > 1) 

) 

relCopy -= rptDist; 

3 5 rptRunCnt = 0; 

rptDist := 0; 
rptRest = 0; 

while (--rptRestCopy >= 0) C // output incomplete match 

AddReloc ( rptCopy C retCopy & kRptBufMask 3, rptD i stCopy+1 ); 

4 0 rptDistCopy = 0; 

relCGpy^-+; 

> 

AddReloc ( wT , 1 ); 
> /* RestartHatch () */ 



45 



50 



void AddReloc ( Reloclnstr w1, int matchStart ) 
i 

static int secondWord; 



if (rptDist) C //is repeat buffer started? 

// yes- check if next word continues repeat 
if (w1 && wl == rptBuf [ relWordCnt-rptDist+rptRest & kRptBufMask ]) C 

rptRest++; // yes- check if match complete? 

5 5 if (rptRest == rptDist) C 

rptRunCnt ++; // yes- incremnt run count 

rptRest = 0; // clear count of uncompleted run 

> 

> 

60 else ( // no- match continuation failed 

RestartMatch ( w1 ); 

> 

return; 
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// case of no partial match started yet 

^ //no - check for match anywhere 

If (matchStart > 1) secondUord = 0; // sequence is being restarted by Restart 
5 boundary " restarted sequence must start on instr 

// because match started on instr boundary 
If (! secondWord && u1) // only check if start of instr 

CheckRelocMatch ( wl, matchStart ); 

(rptOist) < // matched somewhere? 

-^^ (rptRest == rptDist) C // yes- is match complete^ 

rptRunCnt-^*; // yes- increment run cnt 

^ rptRest = 0; // set rest cnt to 0 

> 

^^^^ if (w1) < //no - add to match buffer, output 

rptBuf [ relWordCnt & kRptBufHask] w1; 
Prtfteloc ( w1 ); 

secondWord = ( secondWord ? 0 : IsLARG ( wl » RELOPSHFT > 

> 

20 ) 

) /* AddReloc () */ 

void dorform ( int op, long run, long max > 
25 C 

int cnt; 



30 



35 



40 



50 



wh i I e ( run > 0 ) ( 

cnt = run > max ? max : run; 

AddReloc ( (op « RELOPSHFT) + cnt - 1, 1 ); 
run -= cnt; 

) 

> /* dorform () */ 



void doform ( int op, long cnt ) 

<: 

AddReloc ( (op « RELOPSHFT) + cnt, 1 ); 
> /* doform () */ 



void dolform ( int op, long offset ) 

AddReloc ( (op « RELOPSHFT) + ((offset >> 16) & 1023), 1 )- 
AddReloc ( offset & 65535, 1 ); 
> /* dolform () */ 



void dodform ( int op, int delta, int run ) 
( 

AddReloc ( <op « RELOPSHFT) ♦ (delta « 4) + run 1 )• 
> /* dodform () */ ' ' 



5 5 void dodelta ( long relpos, long delta ) 

if (! delta) return; 
if (delta >= 0 && delta <= ksDELTA) 
doform { krDELT, detta-1 ); 
60 else 

dolform ( krLABS, relpos ); 
> /* dodelta () */ 



65 int RelCvt ( TLdRelPtr reLoc, long count ) 
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long curpos = 0; 

long relpos; 

long delta; 

long run; 

long r Index; 

long symlndex = 0; 

long peff Index; 



90 



10 DataAdjust ( reloc, count ); 



15 



relocations = (Reloclnstr *) Malloc ( count * sizeof (Reloclnstr) * 2 * 2 + 8 ); 
// max possible size of relocation: old count * 2 bytes per reloclnstr 
// * 2 for long form of retoc instr * 2 for two instrs 

if (! relocations) return -1; 



if ( count > 0 ) ( 

if (codeSectNLin >• 0) 
2 0 if ( codeSectNum != 0) 

doform ( krCDIS, codeSectNLfii ); 

if (dataSectNun >= 0) 

if ( dataSectNum != 1 ) 

2 5 doform ( krOTIS, dataSectNLn ); 

) 

whi le (count > 0) C 

relpos = RelocAddr ( reloc ); // address to relocate 

3 0 delta = relpos - curpos; // delta from last address 

curpos = relpos; // new curpos is addr to relocate 

run = 1 ; 

3 5 // Look for run of 

// SYM,syH+1 ,SYH+2. . . 

r Index = Reloc Index ( reloc ); 

40 if (rindex > BSSINDEX) < 

peff Index = XCOFFSyml ndexToPeff Import Number [rindex - kSpec i alSyinboLCount J ; 
if (peff Index == symlndex) ( 
whi le (count >= 2 && 

RelocAddr ( reloc+1 ) == curpos+4 && 

4 5 XCOFFSymlndexToPef f importNumber [Reloclndex ( reloc+1 ) - 

kSpec i a I Symbo I Count ] 

== symlndex-*-!) < 

curpos += 4; 
reloc++; 

5 0 symlndex++; 

rurr»-*; 
count-- ; 

> 

> 

55 

dodelta ( relpos, delta ); 

if (run > 1 II peff Index == symlndex) ( 

dorform ( krSYHR, run, ksISMAX ); 

symlndex++; 

60 > 

else C 

if (peff Index >= ksISHAX) 

dolform ( krLSYH, pefflndex ); 
else 

6 5 doform ( krSYMB, pefflndex ); 
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symlndex = pef f Index^l ; 

> 

curpos += 4; 
reloc-»-+; 
index run; 
count - - ; 
continue; 



XO // look for run of: 
// code, 
// data 

15 



20 



whi te (count >= 2 && 

RelocAddr ( reloc+1 ) == curpos+4 && 
Reloc Index ( reloc+1 ) == r Index) ( 
curpos += 4; 
reloc**; 
run++; 
count--; 

> 



// no single run found, 
// look for run of: 
2 5 // desc ( code/da ta/btank) 

// vtbUdata/btank) 



30 



if (run == 1 && rindex == 0) C // desc (12 byte form) 

whi le (count >= 2 && 



RelocAddr ( reloc ) == curpos && 
Reloclndex ( reloc ) == COOEINDEX && 
RelocAddr ( reloc+1 ) == curpos+A && 
Reloclndex ( reloc+1 ) == DATAINDEX && 
-.J- (count == 2 II RelocAddr ( reloc+2 ) >= curpos+12) ) C 

■jZ> curpos += 12; 

reloc += 2; 
run++ ; 
count -= 2; 

4 0 if (run > 1) C 

run--; 

dodelta ( relpos, delta ); 
dorform ( krOESC, run, ksDEMAX ); 
index += run*2; 
continue; 

> 

while (count >= 2 && // desc (8 byte form) 

RelocAddr ( reloc ) == curpos && 
Reloclndex ( reloc ) == COOEINDEX && 
RelocAddr ( reloc+1 ) == curpos+4 && 
Reloclndex ( reloc+1 ) s= DATAINDEX) { 
if (count > 2) < // try to exclude 12 byte form 

long addr = RelocAddr ( reloc+2 ); 
2^ ^ (addr < curpos+8 [| addr == curpos+12) break; 

curpos += 8; 
reloc += 2; 
run++ ; 
count -= 2; 

> 

if (run > 1) C 
run- - ; 

dodelta ( relpos, delta ); 
dorform ( krDSC2, run, ksDEMAX ); 
index += run*2; 



45 



50 



60 
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cont inue; 

> 

> 

else if (run =^ 1 && r Index == 1 ) ( // vtbl 

5 whi le (count >= H &S 

RelocAddr ( reloc+1 ) == curpos+8 && 
Re Loc Index ( reloc+1 ) == DATA INDEX && 
(count == 2 ; I RelocAddr ( reloc+2 ) >= curpos+16) ) < 
curpos += 8; 
10 reloc*+; 

run-t-* ; 
count- - ; 

> 

if (run > 1) { 

15 /* 

if (run <= ksDVRHAX) < 

if (delta > ksDVDMAX [; (delta & 3)> < 
dodelta ( relpos, delta ); 
delta = 0; 

20 > 

dodform ( JcrOVBL, delta << 2, run-1 )• 

) 

else 

*/ 

25 < 

dodelta ( relpos, delta ); 
dorform ( krVTBL, run, ksVTMAX ); 

curpos 8; 
ovJ reloc++; 

index += run; 
count--; 
cont i nue; 

> 

35 > 

if ( r Index == DATA INDEX && run <= ksDDRHAX ) C 
if (delta > ksOODMAX ;| (delta & 3)) < 
dodelta ( relpos, delta ); 
4 0 delta = 0; 

> 

dodform ( krDDAT, delta, run V 

> 

else C 

4 5 dodelta ( relpos, delta ); 

^ dorform ( rlndex == 0 ? krCODE : krDATA, run, ksDATA ); 

curpos += 4; 
reloc+'»■; 
5 0 index run; 

count--; 

> 

c-c ^^"^^ buffers - with the unmatchable and reserved instruction opcode 

33 AddReloc ( 0, 1 ); - 



60 



return 0; 
> /* RelCvt () */ 



int FindStuff ( byte *theContainer ) 
C 

fileHdrPtr = (TFi leHdrPtr)theContainer; 

auxHdrPtr = (TAuxHdrPtr)(f t leHdrPtr +'l); 

scnHdrs = (TScnHdrPtrXauxHdrPtr + 1);' 
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IdrScnHdr = & scnHdrs [auxHdrPtr -> ahSnLoader- 1 ] ; 

IdHdrPtr = (TLdHdrPtr) ( theContainer ^ LdrScnHdr -> 

IdSyms = (TLdSyntf>tr) (IdHdrPtr + 1); 

IdRelocs = ^TLdRelPtr) ( IdSyms + ldHdrPtr-> IhNSyms) , 

IdStrings = (byte *) IdHdrPtr + ldHdrPtr-> IhSTOf f ; 



shScnPtr); 



10 



secNum CO] = auxHdrPtr -> ahSnText; 
secNum CI] = auxHdrPtr -> ahSnOata; 
secNum C2] = auxHdrPtr -> ahSnBSS; 



codeScnHdr = & scnHdrs C secNLm CO] - 1 ] ; 

15 codeStart = theContainer + codeScnHdr -> shScnPtr; 

codeLen = codeScnHdr -> shSize; 

codeVAddr = (long) codeScnHdr -> shVAddr; 

dataScnHdr = & scnHdrs C secNum £1] - 1 3; 

2 0 dataStart = theContainer ■*■ dataScnHdr -> shScnPtr; 

dataLen = dataScnHdr -> shsize; 

dataVAddr = (long) dataScnHdr -> shVAddr; 

bssScnHdr = & scnHdrs t secNtm [2] - 1 ] ; 

2 5 bssLen = bssScnHdr -> shSize; 

bssVAddr = (long) bssScnHdr -> shVAddr; 
if (optBSSExpand) < 

bssStart = Malloc ( bssLen ); 

clearbytes ( bssStart, bssLen ); 

30 > 



35 



40 



45 



if 
C 



(auxHdrPtr -> ahSnText) 



codeScnHdr 
codeStart 
codeLen 
codeVAddr 

se 

codeScnHdr 
codeStart 
codeLen 
codeVAddr 



= & scnHdrs C secNun CO] - 1 ] ; 

= theContainer codeScnHdr -> shScnPtr; 

= codeScnHdr -> shSize; 

= (long) codeScnHdr •> shVAddr; 



= 0; 
= 0; 
^ 0; 
= OL; 



50 



55 



60 



if 



(auxHdrPtr -> ahSrOata) 



dataScnHdr 
dataStart 
dataLen 
dataVAddr 

se 

dataScnHdr 
dataStart 
dataLen 
dataVAddr 



= & scnHdrs C secNun C1] - 1 ]; 

= theContainer + dataScnHdr -> shScnPtr; 

= dataScnHdr -> shSize; 

= (long) dataScnHdr -> shVAddr; 



= 0; 
= 0; 
= 0; 
= OL; 



65 



if (auxHdrPtr 
( 

bssScnHdr 
bssLen 



-> ahSnSSS) 

= & scnHdrs C secNum C23 - 1 ]; 
= bssScnHdr -> shSize; 
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bssVAddr = (long) bssScnHdr -> shVAddr; 

if (optSSSExpand) 

< 

bssStart Mai toe ( bssLen ); 

5 ctearbytes ( bssStart, bssLen ); 

> 

else 
i 

10 bssScnHdr = 0; 

bssLen = 0; 

bssVAddr = OL; 

if (optBSSExpand) 

( 

15 bssStart = (byte*)0; 

> 



2 0 locNLm [01 = 0; 

tocNLfn [1) = 0; 

locNuin [2] = scnHdrs [auxHdrPtr -> ahSnOata - 1], shSize; 

syrnCnt = IdHdrPtr -> IhNSyms; 
2 5 relCnt = IdHdrPtr -> IhHReloc; 



30 



40 



return 0; 
> /* FindStuff () */ 



#if 0 

long FindExportSectionNLmber ( long export NLmber ) 
i 

long seen; 

3 5 //--return exportSymboLs [exportNumber] . sect i onN arise r; 



for (seen = kReExport Import; seen < kMaxLoadableSect i ons; secn++) { 
if (exportNLmber >= loaderHdr [seen], expFirst && 

exportNunber < loaderHdr [seen]. expFirst + loaderHdr [seen]. nLfT£xports) 
return seen; 



return -1; 
> /* F indExportSect i onNLfTtier ( ) */ 
4 5 #endif 

#if 0 

tnt FindSymbot ( byte *name, long *secn, long *offset ) 
50 <: 

i nt i ; 

int length; 

*secn = -1; 
55 *offset = -1; 

if ( ! name) return 0; 

length = strlen ( (char*) name ); 

^0 for (i = 0; i < IdHdr. numExportSyms; i*-*-) i 

if ( (export Symbols [i]. hashword >> 16) != length) continue; 

if ( comparebytes ( exportSymbols Ci]. nameOffset + stringTable, name, length ) ) < 
•seen = F indExportSect ionNLmber ( i ); 
*offset = exportSymbols CiJ. address; 
d5 return i; 
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> 

printf ("# %s: Symbol %s not found\n", gToolName, name); 

5 

return -1; 
> /* FmdSymbol () */ 
#endif 

10 

long FindXCOFFIndexByName ( byte *name, TLdSymPtr IdSyms, Long symCnt ) 
C 

TLdSymPtr sym; 
long symlndex; 
15 int nameLength; 

int xnamLen; 
byte *xnam; 



20 



30 



35 



40 



45 



nameLength = strlen ( (char *) name ); 



for (symlndex = 0; symlndex < symCnt; symindex++) C 
sym = & IdSyms [symlndex]; 

if ( ! IsVat idlmport ( sym ) && ! IsValidExport ( sym ) ) continue; 
GetName ( sym, & xnam, & xnamLen ); 
2 5 if ( xnamLen == nameLength && 

comparebytes ( xnam, name, nameLength ) ) return symlndex; 

> 



return -1; 
> /* FindXCOFFIndexByName () */ 



int CheckLibOpt ( byte **namep, int *nameLengthp, TVersion *linkedVer, TVersion *oldImpVer, 
TBoolean *iB ) 



int i ; 

byte *UbStr; 
int nl; 



*linkedVer - kDef Version; 
*oldImpVer = kDefVersion; 
*i8 = 0; 

for <i =0; i < libOpts; i++) C 
I ibStr s Ubinfo Ci] ; 

nl = mchlen ( libStr, (byte *) "=#-!-" ); 

if ( nl == *nameLengthp && comparebytes ( libStr, *namep, nl ) ) < 
for (;;) ( 

libStr Cnl] ) C 
: break; 

: libStr += nl + 1; 

nl = mchlen < libStr, (byte *) »=#-!-» ); 
*namep = I ibStr; 



5 0 switch ( libStr Cnl] ) C 

case '\0' : break' 



case 



55 *nameLengthp = nl; 

continue; 

case '#' : *oldImpVer = *linkedVer = Getint ( libStr + nl + 1, & libStr, 0 ); 
nl = 0; continue; 

case : *oldImpVer = Getint ( libStr + nl + 1, & libStr, 0 ); nl = 0; 

60 continue; 

// case : nl++; continue; 

conti nue; 
continue; 



65 break; 



case : nl++; continue; 

case '!' : nl++; *iB j= kInitBefore; 

case : nl^+; ♦iS 1= kWeak Import; 
> 
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> 

break; 



> 



> 

return 0; 
> /* CheckLibOpt () */ 



int ProcessExportLt St ( void ) 
10 C 

int err; 

byte *buf, *ptr, *end; 
long len; 
I ong i ndex ; 

err = OpenRead ( expFileName, & buf, & ten ); 
if (err) return err; 



end = buf + I en; 

for ( ptr = buf; ptr < end; ) C 

if <*Dtr++ '\n') addExpCnt++- 

> 

2 5 addExpAry = (AddExpType *) Malloc ( addExpCnt * sizeof (AddExpType) ); 

if C! addExpAry) < 

printf ("# Xs: Could not allocate Xd bytes\n'\ gToolName, addExpCnt • sizeof 
(AddExpType)); 

return -1; 

30 > 

index = 0; 

for ( ptr = buf; ptr < end; ) C 

while (ptr < end && (*ptr == ' ' ;| *ptr == '\t')) ptr+-^; 
-ib if ( *pTr == '3' ) < 

ptr-t-^; 

if ( ptr >= end ) goto ExpErr; 

// format aoldnarne newname 
addExpAry [index], format = kEqui valence; 
^0 addExpAry [index]. oLdname = ptr; 

whi le (ptr < end && 

(*ptr ♦ptr != '\t' && *ptr != '\r' && *ptr >= '\n')) ptr*+- 

if (ptr >= end) goto ExpErr; ' 
*ptr++ = 0; 

45 > 

else C 



addExpAry [index], format = ^Absolute; 
// format sec#:offset class newname 

addExpAry [index], seen = Getint ( ptr, & ptr, end ); 
if (*ptr-tH- !=r | j ptr > end) goto ExpErr; 

addExpAry [index], offset = Getint ( ptr, & ptr, end ); 
while (ptr < end && (*ptr == ' ' |; *ptr ~ '\t')) ptr-^; 
if (ptr >= end) goto ExpErr; 

r- r~ addExpAry [index], class = Getint ( ptr, & ptr, end)* 

b5 > 

while (prr < end && (*ptr == ' ' || *ptr == '\t')) ptr++; 
if (ptr >= end) goto ExpErr; 
addExpAry [index], name = ptr; 

white (ptr < end && *ptr »= '\r' && •ptr != '\n') ptr++; 
if (ptr >= end) goto ExpErr; 
*ptr++ = 0; 
i ndex++ ; 

> 

//" don't unmap, its in use! ReleaseFile ( buf, len ); 
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return 0; 
ExpErr : 

printf %s: enror in export list, line %d\n*\ gToolName, tndex-1); 
5 return -1; 

> /* ProcessExportLi St C ) •/ 

int CheckXCOFF ( byte *theConta i ner ) 
10 { 

TFileHdrPtr fileHdrPtr = <TFi leHdrPtr)theContainer; 

TAuxHdrPtr auxHdrPtr = (TAuxHdrPtrX f i leHdrPtr + 1); 

/* Verify that the container is valid XCOFF. The "file" header must have */ 
15 /* the right magic nurber, the right size for the auxiliary header, and */ 

/* executable and dynamic load flags must be set. The auxiliary header ♦/ 
/* must have the right version number and non-zero section numbers for */ 
/* the .text, .data, .bss, and .loader sections. 
*/ 

20 /* */ 

if (fileHdrPtr->fhHagic != AI XPowerPCHagic !| 
f i leHdrPtr->fhOptHdr != si zeof (TAuxHdr) || 
! IsExec<*f i leHdrPtr) |[ 
25 ! IsDynLoad(*fi leHdrPtr) ) return badUnitErr; 

if (auxHdrPtr->ahVStamp != kXCOFFVersi on |[ 
auxHdrPtr->ahSnText == 0 | | 

3 0 auxHdrPtr->ahSri)ata == 0 | J 

auxHdrPtr->ahSnBSS == 0 | | 

auxHdrPtr->ahSnLoader == 0 ) return badUnitErr; 

35 if <auxHdrPtr->ahVStamp != kXCOFFVersion |[ 

auxHdrPtr->ahSnLoader == 0 ) return badUnitErr; 

return 0; 

4 0 > /* CheckXCOFF () */ 

int OpenRead ( byte *fName, byte **buffer, long *length ) 
C 

4 5 short refNum; 

byte *ptr; 
long I en; 
long err; 
Str255 pName; 



50 



err = MakePStr ( fName, strlen ( (char*) fName ), pWame, sizeof pName ); 
if (err) return err; 



err = HOpenDF ( 0, 0, pName, fsRdPerm, & refNtm ); 
^5 if (err) { printf ("# %s: Could not open input file Xs Xd\n", gToolName, fName, err)- 

return -1; > 

err = GetEOF ( refNum, & len ); 

Cerr) < FSClose ( refNum ); printf ("# %s: Could not seek in %s 5Cd\n", gToolName, 
oU fName, err); return -1; > 

ptr = (byte *) NewPtr ( len ); 

if (• ptr) C FSClose ( refNisn ); printf ("# %s: Could not allocate %d bytesXn", 
gToolName, len); return -1; > 

b 5 
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err = FSRead ( refNum, & ten, ptr ); 

if (err) C FSClose ( refNum ); printf %s: Could not read %s %d\n'\ gToolName, fWame, 
err); return -1; > 

5 FSClose ( refNum ); 

*buffer = ptr; 
*length = len; 

10 return 0; 

> /* OpenRead () */ 

void movebytes ( byte *s, byte *t, long length ) 
15 i 

while (--length >= 0) *t-»-+ = *s+-t-; 

> /* movebytes C) */ 

2 0 void clearbytes C byte *t, long length ) 

<: 

while (--length >= 0) *t++ = 0; 

> /* clearbytes () */ 

25 

int comoarebytes ( byte *s, byte *t, int length ) 
i 

while (--length >= 0) if (*s++ != *t++) return 0; 
return 1 ; 

3 0 > /* coinparebytes C) */ 

byte mem C13 ; 

byte *HaUoc ( long len ) 
35 i 

extern byte *malloc ( long Len ); 

if (! len) return mem; 
return (byte*) (malloc ( len )); 
40 ) /* Malloc () */ 

void Free ( byte *tofree ) 
i 

45 ^pragma unusedCtof ree) 

extern void free ( byte * ); 

/* ha! don't free anything, 

since we are almost never doing so anyway, 
50 and we are mixing NewPtr with Malloc (yuk) 

if ( tofree != mem ) 
free (tofree); 

*/ 

> /* Free () */ 

long Write ( int fd, byte *buf, long len ) 
{ 

int err; 

if { ! len) return 0; 
60 err = FSWrite ( fd, & len, buf ); 

if (err) return err; 
return len; 

> /* Write () •/ 

65 
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long Getint ( byte *ptr, byte **out, byte *cnd ) 
C 

Boolean neg = 0; 
long value = 0; 
5 int mp = 10; 

int off; 

if (! end) end = ptr + 16; 

while (ptr < end && (*ptr ' ' \ \ *ptr == '\t')) ptr^-^j 
10 if (*ptr -= '-') < neg - 1; ptr++; ) 

if (*ptr " '0') ( 
ptr++; 

if C*ptr == 'X') i ptr++; mp = 16; > 
else < mp = 8; > 

15 > 

while (per < end && *ptr) C 

if (*ptr >= '0' && •ptr <= '9') off = '0'; 

else if (*ptr >= 'a' && *ptr <= 'f') off = 'a' - 10; 
20 else if (*ptr >= 'A' && *ptr <= 'F') off = 'A' • 10; 

else break; 
value *- mp; 
value += *ptr-»'+ - off; 

> 

2 5 if (out) 

*out = ptr; 
return neg ? -value : value; 

> /* Getint () V 

30 

int mchlen ( byte *str, byte *mch ) 
i 

int I en = 0; 
byte *mptr; 

3 5 byte ch; 

for (;;) C 

ch = *str++; 
if (! ch) return I en; 
40 mptr = mch; 

while (*mptr) if (*mptr-^+ == ch) return len; 
I erH-+ ; 

> 

> /* mchlen () */ 



45 



50 



55 



60 



int Log2 ( unsigned long a In ) 
i 

int i ; 
a In--; 

for (i =0; aln; i++) aln »= 1; 
return i ; 
> /* Log2 () */ 



OSErr MakePStr ( byte *src, int len, byte *trg, int max ) 
C 

if (len >= max) return ioErr; 

movebytes ( src, trg+1, len ); 
*trg = (byte) len; 



return noErr; 
o5 > /* MakePStr () */ 
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/* UNIX -> MAC date convert. Runs on UNIX only */ 

#include <time.h> 

long HacTimedong unixTime) 

static time_t convert = 0; 
if ( unixTime == 0 ) return 0; 



i f ( ! convert) 

15 time_t gmt = time(O); 

struct tm local = *localtime(&gmt); 

convert = mkt imeC&local) - mktime(gmtime(&gmt)); 
1 f ( local . tm^isdst ) 
2 0 convert += 60 * 60; 

convert += 0x7c25b08Q; /* magic number for 1/1/70 in mac time ^/ 

2 5 return unixTime + convert ; 

> 



3 0 void convertString2Long (byte* theString, unsigned long* theLong) 

< 

♦theLong = (( long)( theStringCOl << 24) + ( long) ( theStr ing [1] « 16) + ( long) ( theStr i ng [2] 
« 8) + ( long)theStringC3) ); 
> 

35 

#if MAC 

long GetFileTime ( byte *fileName ) 
C 

40 CInfoPBRec pb; 

OSErr err; 
Str255 pName; 

err = MakePStr ( fileName, strlen ( (char*) fileName ), pName, sizeof pName ); 

4 5 if (err) return 0; 

clearbytes ( (byte *) & pb, sizeof pb ); 

pb. hFilelnfo. ioNamePtr = pName; 

err = PBGetCatlnfo { & pb, FALSE ); 
if ( err ) return 0; 



return pb. hFilelnfo. ioFlMdDat; 
55 > /* GetFileTime () */ 

#endif 



60 



int CountCompareBytes ( register byte *s, register byte *t, register int length ) 
65 int fullLength = length; 
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while (--length >= 0) if (*s++ !* *t++) return f uU length- 1 ength- 1 ; 
return fullLength; 
> /* CountCompareBytes () */ 



long CountZeroSytes ( byte *src, long length ) 
( 

long cnt = 0; 

while ( --length >= 0 ) C 
10 if ( *src++ 0 ) cnt++; 

> 

return cnt; 
> /* CountZeroSytes () */ 



15 



45 



#define kMaxForwardMatch 12 



typedef struct i 
long opcode; 
20 long matchLen; 

long diffCnt; 
long rptCnt; 
long backup; 

2 5 byte *ptr; 

byte *end; 

long coverage; 
long cost; 

3 0 long overhead; 

long gain; 
> Packlnstr, *PackInstrPtr; 

3 5 byte *gBlockStart; 

/* 

RBLK rptCnt, matchCnt, diffCnt 

RBLK rptCnt=1, matchCnt=0, diffCnt=x ==> BLK x 

4 0 RBLK rptCnt=x, matchCnt=y, diffCnt=0 ==> RPT y,x 

*/ 



byte *targetBuf, *targetEnd; 



OSErr EmitBytes < byte *pointer, long count ) 
< 

register long cnt = count; 
50 register byte *ptr = pointer; 

int Ip = 0; 

if ( cnt <= 0 ) return noErr; 

55 if ( targetBuf + cnt >= targetEnd ) return -1; 

while ( --cnt >= 0 ) < 
*targetBuf*+ = ^ptr*-*-; 

> 

60 

#if DEBUG 

ptr = pointer; 
cnt = count; 
printf (" "); 
o5 whi le ( --cnt >- 0 ) i 
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10 



printf C" %02x", *ptr++); 
if ( +>lp == 15 ) C 

printf ("\n"); 

printf (" , "); 

Lp = 0; 

) 

) 

printf {"\n"); 

if ( debug ) f flush ( stdout ); 
#endif 



15 



return noErr; 
) /* EmitSytes () */ 



Packlnstr buflnst; 
Packlnstr newlnst; 



20 
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OSErr Emi tCount ( long count ) 

i 

if ( count < 0 ) return noErr; 

if ( targetBuf >= targetEnd-5 ) return -1; 

if ( count <= 127 ) < 
*targetBuf++ = count; 

> 

else if < count <= (127«7) + 127 ) < 
*targetSuf-»-* = (count >> 7) \ 0x80; 
♦targetBuf ++ = count & Ox7f; 

) 

else if ( count <= ( ( 127<<7)+1 27«7) + 127 ) < 
*targetBuf++ = (count >> 14 ) [ 0x80; 
*targetBuf++ = (count >> 7) | 0x80; 
*targetBuf = count & 0x7f; 

) 

else if ( count <= ( ( ( 127«7)+127<<7) + 127<<7) + 127 ) C 
*targetBuf++ = (count » 21 ) 1 0x80; 
*targetBuf++ ~ (count >> K ) [ 0x80; 
*targetBuf++ = (count >> 7) | 0x80; 
* targetBuf = count & 0x7f; 

> 

else < 

*targetBuf++ = (count >> 28 ) | 0x80; 
*target8uf++ = (count >> 21 ) | 0x80; 
*targetBuf-^+ = (count >> 14 > | 0x80; 
*targetBuf++ = (count >> 7) } 0x80; 
*targetBuf++ = count & 0x7f; 



55 



60 



65 



return noErr; 
> /* EmitCount () */ 



OSErr EmitOpcode ( byte op, long count ) 

if ( count <= 0 ) return noErr; 

if ( targetBuf >= targetEnd ) return -1; 

if ( count <= kF i rstOperanc^^ask ) < 

* targetBuf ++ = (op << kOpcodeSh i f t ) count; 
return noErr; 

> 
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•targetBuf** = (op << kOpco'deSh i f t ) ; 
return EfnitCount ( count ); 
> /* EmitOpcode () */, 
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OSErr EmitOnelnst ( byte ch, PacklnstrPtr ibp ) 
( 

^pragma unused(ch) 
long cnt; 
byte *ptr; 
OSErr err; 

if ( ibp -> opcode == kNoOpcode ) return noErr; 

#if DEBUG 

printf ("%c%04lx-mix», 
ch. 



#endif 



ibp -> ptr - gBlockStart, 

ibp -> end - gB locks tart - 1); 



switch ( ibp -> opcode ) C 
case kZero : 



// 

#if DEBUG 



#endif 

case kBlock 



// 

#if DEBUG 



#endi f 



case kRepeat: 



// 

#if DEBUG 



#end if 



err = EmitOpcode ( kZero, ibp -> rptCnt ); 
if ( err ) return err; 

printf C" ZRO #%d\n", ibp -> rptCnt ); 

printf (" ZRO #%d", ibp -> rptCnt ); 

pri ntf < "\ tv=%d, c=%d, o=%d, g=%d\n*' , 

ibp -> coverage, 

ibp -> cost, 

ibp -> overhead, 

ibp -> gain ); 

break; 

err = EmitOpcode ( kBlock, ibp -> diffCnt ); 
if ( err ) return err; 

printf (" BIK #%d\n'*, ibp -> diffCnt ); 

printf (" BLK ibp -> diffCnt ); 

printf ( " \ t v=%d , c =%d , o=%d , g='^d\n" , 

ibp -> coverage, 

ibp -> cost, 

ibp -> overhead, 

ibp -> gain ); 

err = Emi tOytes ( ibp -> ptr, ibp -> diffCnt ); 

if ( err ) return err; 

break; 

err = EmitOpcode ( kRepeat, ibp -> diffCnt ); 
if ( err ) return err; 

printf (" RPT #%d*%d\n", ibp -> matchLen, ibp -> diffCnt )j 

printf (" RPT #%d*%d", ibp -> matchLen, ibp -> diffCnt )• 
printf ("\tv=%d,c=%d,o=%d,g=%d\n", 

ibp -> coverage, 

ibp -> cost, 

ibp -> overhead, 

ibp -> gain ); 

err = EmitBytes ( ibp -> ptr, ibp -> matchLen ); 
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if ( err ) return err; 
break; 
case kRepeatBlock : 

if ( ibp -> diffCnt == 0 ) ( 

err = £mi cOpcode ( kRepeat, ibp -> matchLen ); 

if C err ) return err; 

> 

else i 

err = EmitOpcode ( kRepeatBlock, ibp -> matchLen ); 

i f ( err ) return err; 

err = EmitCount ( ibp -> diffCnt ); 

if { err ) return err; 

> 

err = EmitCount ( ibp -> rptCnt - 1 ); 
i f ( err ) return err; 

printf (" RBLIC #C%d,Xd)*%d\n" , ibp -> matchLen, ibp -> diffCnt, ibp -> 



// 

rptCnt ); 
#if DEBUG 

rptCnt ); 



printf (" RBLK #(%d,%d)*%d", ibp -> matchLen, ibp -> diffCnt, ibp -> 

pr i nt f ( "\ tv='/^ , c=%d, o=%d , g=%d\n" , 

ibp -> coverage, 

ibp -> cost, 

i bp - > overhead, 

ibp -> gain ); 



#endif 



err = EmitBytes ( ibp -> ptr, ibp -> matchLen ); 

if ( err > return err; 

cnt = ibp -> rptCnt - 1; 

ptr = ibp -> ptr + ibp -> matchLen; 

while ( --cnt >= 0 ) < 

err = EmitBytes C ptr, ibp -> diffCnt ); 

if ( err ) return err; 

ptr ibp -> matchLen + ibp -> diffCnt; 

> 

break; 
case kRepeatZero : 

err = EmitOpcode ( kRepeatZero, ibp -> matchLen ); 

if ( err ) return err; 

err = EmitCount ( ibp -> diffCnt ); 

if ( err ) return err; 

err = EmitCount ( ibp -> rptCnt - 1 ); 
if ( err ) return err; 

printf (" RZRO #(Xd,%d)*%d\n", ibp -> matchLen, ibp -> diffCnt, ibp 



// 

rptCnt ); 
#if DEBUG 

rptCnt ); 



#endi f 



default 
#if DEBUG 



printf (" RZRO #(%d,%d)*%d" , ibp -> matchLen, ibp -> diffCnt, ibp -> 

pr i nt f < •'\tv=%d, c=%d, o=%d, g=%d\n" , 

ibp -> coverage, 

ibp -> cost, 

ibp -> overhead, 

ibp -> gain ); 

cnt = ibp -> rptCnt • 1; 

ptr = ibp -> ptr + ibp -> matchLen; 

whi le ( --cnt >= 0 ) C 

err = EmitBytes ( ptr, ibp -> diffCnt ); 

if ( err ) return err; 

ptr += ibp -> matchLen * ibp -> diffCnt; 

> 

break; 



printf ("??? %d\n", ibp -> opcode); 
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#endTf 
) 



break; 



if ( ch ==.' = ' ) 

gPackedSize += ibp -> overhead + ibp -> cost; 



ibp -> opcode = kNoOpcode; 
#if DEBUG 

if ( debug ) f flush ( stdout ); 
#endi f 

return noErr; 
) /* EmitOnelnst () */ 

20 OSErr Emitlnst ( PacklnstrPtr pi ) 
OSErr err; 

if ( pi -> opcode == kNoOpcode ) return noErr; 

2 5 err - EmitOnelnst ( ' = pi ); 

pi -> opcode = kMoOpcode; 
return err; 
> /* Emitlnst () */ 

30 

mt Matches < register byte *ptr, register byte *end, PacklnstrPtr bestlnstr ) 

register long distance; 
register long matchLen; 

3 5 register long rptCnt; 

long bestMatchLen; 
long repeatedZeros; 
long zeroBytes; 
byte *match; 
40 byte found; 

byte tryShorter; 
byte ch; 

Packlnstr thislnstr; 
long backup; 

4 5 long backupCnt; 

long backupRptCnt; 
long fwd; 
long fwdCnt; 
long fwd3Cnt; 
50 long fwdRptCnt; 

long fwd3RptCnt; 
long thisFwd; 
long matchCount; 

55 clearbytes ( (byte *> bestlnstr, sizeof *bestlnstr )- 

found = FALSE; 

bestMatchLen = 0; 

60 ch = *ptr; 

for ( distance = 1; distance <= kMaxForwardMatch; distance*-* ) < 
if ( ptr + distance >= end ) break; 
if ( ch != ptr [distance] ) continue; 
// found a potential match 

^or ( matchLen = 1; matchLen < distance; matchLen-*-* ) C 
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if ( ptr + distance + matchLen >= end ) break; 

if ( ptr CmatchLen) != ptr [distance ♦ matchLenJ ) break; 

> 

5 if ( matchLen <"= bestMatchLen ) continue; 

bestMatchLen = matchLen; 

Shorter: 

10 tryShorter = FALSE; 

repeatedZeros = CountZeroBytes ( ptr, matchLen ); 
// if ( repeatedZeros == matchLen ) then we know it is 
// a repeated zero 

15 // type of match 

thislnstr. diffCnt = distance - matchLen; 

if ( thislnstr. diffCnt == 0 && matchLen > 1 ) ( 
2 0 // remember, diff count is zero 

// so, this is a repeat only 
if ( repeatedZeros == matchLen 1| 
matchLen >= 5 && 

5 * repeatedZeros >= 3 ' matchLen ) 

2 5 continue; 

> 

thislnstr. backup = 0; 
backup = 0; 

3 0 backupCnt = 0; 

backupRptCnt = 0; 
fud = 0; 
fwdCnt = 0; 
fwdRptCnt = 0; 

3 5 fwcDCnt = 0; 

fwd3RptCnt = 0; 

match = ptr; 
zeroBytes = 0; 

4 0 for ( rptCnt = 1; ; rptCnt++ ) C 

match -»■= distance; 

if ( match + matchLen >= end ) break; 
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matchCount = CountCompareBytes C ptr, match, matchLen ); 

if ( matchCount < matchLen ) ( 

if ( matchCount > tryShorter = TRUE; 
break; 

> 



// match of at least one Cie: rpt of 2) 
// now, see if it is practical to go backward 
// to extend the match 
// or should this be an on going thing 

5 5 if ( backup == 0 ) C 

backupRptCnt = rptCnt; 
while ( backup < thislnstr. diffCnt && 
buflnst. opcode == kZero && 
backup < buflnst. rptCnt && 
60 ptr [-backup- 1] == match [-backup-1] ) C 

backup*-*; 

> 

> 

// count backup byte compares 

6 5 else if ( comparebytes ( ptr -backup, match -backup, backup ) ) C 
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backupCnt++; 

> 

else { 

// we couldn't continue the backup. 

// is it worth stoppin9 the match just to allow the backup? 
// or, deleting the back up? 
i f ( backupCnt > 3 ) ( 
.// it was worth it : 
// so, either use the backup 
// or stop this repeat short of it 
if ( backupRptCnt •= 1 ) 

rptCnt = backupRptCnt - 1; /* ! to try to catch the backup next time! */ 
break; 

> 

15 backup = 0; 

backupCnt = 0; 
backupRptCnt = 0; 

> 



10 



20 if ( thislnstr. diffCnt ) C 

// count forward compares 

thisFwd = CountCompareBytes ( match - thislnstr. diffCnt, match + matchLen 
thislnstr. diffCnt ); 
2 5 if ( fwd == 0 ) < 

if C thisFwd ) C 
fwd = thisFwd; 
fwdCnt = 1; 
fwdRptCnt = rptCnt-1; 

30 > 
> 

else if ( thisFwd ) C 

// some match larger than the current instruction 
f wdCnt++; 

35 > 

else i 

// extra match count no longer present 
if ( fwdCnt >= matchLen 3 ) C 
// this match cnt is noteworthy: 
// record repeat count of this match 
// to stop at. so that a resart can 
// catch this new longer match 
if ( fwdSCnt == 0 ) fwdSRptCnt = fwdRptCnt; 
if ( f+fwd3Cnt >= 2 ) C 
fwd = 0; 
rptCnt = fwd3RptCnt; 
break; 

> 

5 0 fwd = 0; 

fwdCnt = 0; 
fwdRptCnt = 0; 

> 

zeroBytes ♦= CountZeroBytes C match - thislnstr. diffCnt /* matchLen */ 
thislnstr. diffCnt ); 

if ( repeatedZeros " matchLen ) ( 

if < zeroBytes == thislnstr. diffCnt |[ 

3 * zeroBytes >= thislnstr. diffCnt * rptCnt ) break; 

else ( 
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#if trim 
tfendif 



if ( 5 * zeroBytes >= 3 * thislnstr. diffCnt * rptCnt ) break; 

> 
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) 



) 



if ( fgd && fwqtCnt > 3 ) < 
5 // stopped in the middle of a match 

// that had an extended match 
rptCnt = fwdRptCnt; 

> 

10 if ( rptCnt < 2 ) continue; 

if ( rptCnl == 2 && matchLen == 1 && thislnstr. diffCnt == 1 ) continue; 



// trim last repeat off of a RZRO if its dtffpart is all zeros 
// ?? if ( repeatedZeros == > 

// match a dist, mien, for rptCnt 



// did we stop with a good backup? 
if ( backup && backupRptCnt == 1 ) 
2 0 thislnstr. backup = backup; 



// determine 1) coverage 

// 2) cost 

// 3) overhead 



thislnstr. coverage = distance * (rptCnt - 1) + matchLen; 
thislnstr. ptr = ptr; 

thislnstr. end = ptr + thislnstr. coverage; 

30 if ( thislnstr. backup ) ( 

matchLen thislnstr. backup; 
thislnstr. diffCnt -= thislnstr. backup; 
thislnstr. ptr -= thislnstr. backup; 
thislnstr. coverage thislnstr. backup; 
3 5 repeatedZeros CountZeroBytes ( thislnstr. ptr, thislnstr. backup ); 

> 

if C repeatedZeros == matchLen ) { 
if ( thislnstr. diffCnt 0 ) < 
40 thislnstr. opcode = kZero; 

thislnstr. cost = 0; 

> 

else < 

thislnstr. opcode = kRepeatZero; 
45 thislnstr. cost = (rptCnt - 1) * thislnstr. diffCnt; 

> 

> 

else C 

thislnstr. cost = matchLen + (rptCnt - 1) * thislnstr. diffCnt; 

50 thislnstr. opcode = kRepeatBlock; 

> 

thislnstr. overhead = (matchLen != 0) + (thislnstr. diffCnt != 0) + 1; 

if ( thislnstr. cost == 0 ) 
55 thislnstr. gain = thislnstr. coverage * ( 78 - thislnstr. overhead ); 

else 

thislnstr. gain = ( thislnstr. coverage * 100 ) / ( thislnstr. cost + 

thislnstr- overhead ); 

60 // thislnstr. gain -= thislnstr. overhead; 

//thislnstr. gain = thislnstr. coverage - thislnstr. cost - thislnstr. overhead; 



65 



// done later : 

// if ( thislnstr. gain < 122 ) continue; 
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// these next 5 are really part of the if 
// but hear because they are useful for 
// the debugging in the else part 
this Inst r. matchLen = matchLen; 
thislnstr. rptCnt = rptCnt; 

if ( thislnstr. gain >= 104 i& 

( ! found II thislnstr. gain > bestlnstr -> gain ) ) C 
found = TRUE; 



♦bestlnstr - thislnstr; 

if < thislnstr. diffCnt == 0 && 

thislnstr. coverage >= kMaxForwardMatch ) break- 

*^if DEBUG 
15 if ( debug ) 

EmitOnelnst ( ' + ' , & thislnstr ); 

#endi f 

) 

else < 
2 0 #if DEBUG 

if ( debug ) 

EmitOnelnst ( & thislnstr ); 

Jifendif 

} 



if < tryShorter ) < 

matchLen = matchCount; 
goto Shorter; 

> 



return found; 
> /* Matches () ♦/ 



OSErr PackData ( register byte *ptr, register byte *end, byte *target, byte ♦*tend ) 
OSErr err; 

targetBuf = target; 
targetEnd = *tend; 

gBlockStart = ptr; 

buflnst. opcode - kNoOpcode; 
buflnst. end = 0; 

while ( ptr < end ) C 

if C Matches ( ptr, end, & newlnst ) ) i 



if ( newlnst. backup ) < 

buflnst. rptCnt -= newlnst. backup; 
if ( buflnst. rptCnt == 0 ) 
^5 buflnst. opcode = kNoOpcode; 

> 



if ( ptr >= buflnst. end ) < 
err = Emit Inst ( & buflnst ); 
if ( err ) return err; 

> 

buflnst = newlnst; 

// for now, advance 
ptr = newlnst. end; 
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cont inue; 

> 

else ( 

if ( buflnst. opcode == kBlock ) ( 
buflnst. diffCnt+-^; 
buflnst. end-f+; 
buflnst. cost++; 
buf 1 nst - coverage-t-+; 
// buflnst. gain-^-*-; 

> 

else C 

err = Emit I nst ( & buflnst ); 
if ( err ) return err; 

clearbytes ( (byte *) & buflnst, sireof buflnst ); 
buf I nst . opcode = kS lock; 

= "I; 

= 1; 

= Ptr; 

end = ptr*1; 

coverage = 1 ; 



buflnst. diffCnt 
buflnst. rptCnt 
buflnst. ptr 



buflnst 
buflnst 
buflnst 
buf Inst 



cost = 1 

overhead = 1 



buflnst. gain 

> 

> 

pt r++ ; 

> 

Emitlnst C & buflnst ); 

*tend = targetBuf; 

return noErr; 
> /* PackData () */ 



= 100 - 1; 



pascal void SlockClear ( void *s, long I ) 
40 < 

clearbytes ( (byte *) s, (long) I ); 
) /* BlockClear () */ 



45 long GetCount ( register byte *ptr, byte **nxt ) 

( 

register long value = *ptr++; 
register byte nextB; 

50 if ( value & 0x80 ) < 

nextB = *ptr+'t-; 
value = (value & Ox7f> << 7; 
if ( nextB & 0x80 ) < 

value += nextB & Ox7f; 
nextB = *ptr-t"»-; 
value <<= 7; 
if ( nextB & 0x80 ) < 
value += nextB & 0x7f; 
nextB = *ptr+>; 
6 0 value <<= 7; 

if ( nextB & 0x80 ) < 
value +- nextB & 0x7f; 
nextB = *ptr*'^; 
value <<= 7; 

65 > 
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> 

> 

value += nextB; 

> 

5 *nxt = ptr; 

return value; 
> /* GetCount () */ 
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OSErr UnpackData ( register byte 'ptr, register byte *end, register byte *trg, byte *tend ) 

byte *startPtr = ptr; 
register byte u1; 
register long cnt; 
register long cntX; 
register long rep; 
register long dif; 
register long offset; 
byte *nxt; 

#if DEBUG 

printf ("ptr = %x,m end = %x\n'\ ptr, end ); 



#endif 



whi le ( ptr < end } i 
ul = •ptr**; 

if ( u1 & kFirstOperandMask ) cnt = ul & kF i rstOperandHask; 
else C 

cnt = GetCount C ptr, & nxt ); 
ptr = nxt; 

// we know the count is larger than kFirstOperandMask (about 31) 
switch ( u1 >> kOpcodeShift ) C 
case kZero : 



#if DEBUG 
#endif 



case kBlock : 



U\f DEBUG 
#endi f 



printf ( " ZROx %3d\n", cnt ); 

if ( trg+cnt > tend ) return -1; 
BlockClear ( trg, cnt ); 
trg += cnt; 
continue; 



printf ( " BLKx %3d\n", cnt ); 

if ( trg+cnt > tend ) return -1; 
BLockMove ( ptr, trg, cnt ); 
ptr += cnt; 
trg cnt; 
cont inue; 



55 
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switch ( ul » kOpcodeShift ) i 
default : 



kOpcodeShift ); 



case kZero 
#if DEBUG 

#endif 



printf ( %s: Unknown opcode Xd\n" , gToolName, ul >> 
goto EBOT; 

printf ( " 2R0 %3d\n», cnt ); 

if ( trg+cnt > tend ) goto EBOT; 
while ( --cnt >= 0 ) { *trg++ = 0; > 
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case kBlock 
#if DEBUG 

#end 1 f 



case kRepeat 



#if DEBUG 
#endif 



printf ( " Bl< y3d\n" , cnt ); 

if ( trg+cnc > tend ) goto EBOT; 
while C --cnt 0 ) C *trg++ = *ptr++ 
break; 



rep = GetCount ( ptr, & nxt ) + 1; 
ptr = nxt; 

printf ( '» RPT %3d,%3d\n", cnt, rep ); 

if ( cnt == 1 ) < 

// repeat one byte over and over, 

// comnon enough to test for and optimize 

u1 = •ptr++; 

if C trg+rep > tend ) goto EBOT; 
whi le ( --rep >= 0 > C 
*trg++ = ul; 

> 

break; 

> 

whi le ( --rep >= 0 ) { 
ptr = nxt; 
cntX = cnt; 

if ( trg+cntX > tend ) goto EBOT; 
while ( --cntX >= 0 ) < 
*trg++ = *ptr*+; 

> 

> 

break; 



case kRepeatZero 
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#if DEBUG 
#endif 



LI 



dif = GetCount ( ptr, & nxt ); 
ptr = nxt; 

rep = GetCount ( ptr, & nxt ); 
ptr = nxt; 

printf ( RPTZ ?i3d,?3d, %3d\n" , cnt, dif, rep^-1 ); 

// NOTE: rep counts the # of dif parts 
// which equals the repeated parts - 1 
// so, we don't rep--; 

offset = 0; 

// do repeated part first 
goto Li; 

while ( --rep >= 0 ) ( 
// do diff part 
ptr += offset; 
cntX = dif; 

if ( trg+cntX > tend ) goto EBOT; 
while ( --cntX >= 0 ) < 
*trg*+ = *ptr++; 

> 

offset dif; 

//do repeated part 
ptr = nxt; 
cntX = cnt; 
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if ( trgt-cntX > tend ) goto EBOT; 
whi le ( --cntX >= 0 ) ( 
*trg++ = 0; 

> 

) 

ptr += offsets- 
break ; 
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case kRepeatSlock : dif = GetCount C ptr, & nxt ); 
ptr = nxt; 

rep = GetCount ( ptr, & nxt ); 
ptr = nxt; 



printf C RPTB %3d, 53d,%3d\n'\ cnt, dif, repn-l ); 

// NOTE: rep counts the U of dif parts 
// which equals the repeated parts - 1 
// so, we don't rep--; 

2 0 offset = 0; 

// do repeated part first 
goto L2; 

whi le ( --rep >= 0 ) < 
25 //do diff part 

ptr offset; 
cntX = dif; 

if < trg+cntX > tend ) goto EBOT; 
while ( --cntX >= 0 ) < 
*trg++ = *ptr++; 

> 

offset += dif; 



L2 : // do repeated part 

ptr = nxt; 



cntX = cnt; 

if ( trg*cntX > tend ) goto EBOT; 
whi le ( --cntX >= 0 > C 
*trg++ = *ptr++; 

40 > 

) 

ptr offset; 
break; 

45 > 
> 

return noE rr; 
50 EBOT : 

printf ( "# 7.s: Unpack Error a %d\n", gToolName, --ptr - startPtr ); 
return -1; 
> /* UnpackOata () ♦/ 

55 

byte *Part ialName ( byte *nafne ) 
i 

byte *end; 

60 for ( end = name + strlen ( (char *) name ); 

end > name; 
end-- ) { 



65 



if ( end [-13 == ' : ' || end [-1] == ) break; 
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return end; 
> /* Part ialMame () ^/ 
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CLAIMS 

1. A method for loading a relocatable file into a 
computer system memory for execution, said file having 
a plurality of information items to be loaded into said 
memory, certain ones of said information items having a 
respective first address field containing a relocatable 
address, said file further having a plurality of 
relocation instructions, comprising the steps of: 

retrieving one of said relocation instructions; and 
updating the address field of n consecutive ones of 
said information items in response to said one of said 
relocation instructions, n being specified in said one 
of said relocation instructions. 

2. A method according to claim 1, further 
comprising the step of repeating said steps of 
retrieving and updating for different ones of said 
plurality of relocation instructions. 

3. A method according to claim 2, wherein said 
relocation instructions are of different types, said 
step of updating comprising steps which depend upon the 
type of each respective one of said relocation 
instructions . 

4. A method according to claim 2, wherein each of 
said relocation instmctions is located at a respective 
address, wherein the address of said relocation 
instruction is determined from a first pointer (reloc) 
for each repetition of said step of retrieving, further 
comprising, in conjunction with each repetition of said 
step of retrieving, the step of updating said first 
pointer (reloc) to a value dependent upon the length of 
said retrieved relocation instruction. 

5. A method according to claim 4, wherein said step 
of updating said first pointer (reloc) comprises the 
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Step of updating said first pointer (reloc) to point to 
the one of said relocation instructions which 
immediately follows said retrieved relocation 
instruction . 

6. A method according to claim 4, wherein said step 
of updating said first pointer (reloc) comprises the 
step of adding to said first pointer (reloc) the number 
of bytes in said retrieved relocation instruction. 

7. A method according to claim 2, wherein the 
address in said memory of the first of said n 
consecutive information items is determined from a 
second pointer (rAddr) for each repetition of said step 
of updating, further comprising, in conjunction with 
each repetition of said step of updating, the step of 
updating said second pointer (rAddr) to a value 
dependent upon n. 

8. A method according to claim 7, wherein said step 
of updating said second pointer (rAddr) comprises the 
step of updating said second pointer (rAddr) to point to 
the first address in said memory which follows said n 
consecutive information items. 

9. A method according to claim 7, wherein said step 
of updating said second pointer (rAddr) comprises the 
step of adding to said second pointer (rAddr) , n times 
a predetermined number of bytes per information item. 

10. A method according to claim 1, wherein said step 
of updating the first address field comprises the step 
of updating the first address field of each given 
information item in said n consecutive information items 
to a value which depends upon a respective given element 
of a table (imports) . 
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11. A method according to claim 10, wherein said 
step of updating the first address field of each given 
information item comprises the steps of: 

updating the first address field of an i'th one of' 
said n consecutive information items to a value which 
depends upon an i'th element of said table, i being 
determined from a count value (rSyml) ; 

incrementing said count value (rSyml) in conjunction 
with each repetition of said step of updating the first 
address field of an i'th one of said n consecutive 
information items; and 

repeating, for each of said n consecutive information 
items, said step of updating the first address field of 
an i'th one of said n consecutive information items and 
said step of incrementing said count value (rSyml) . 

12. A method according to claim 11, wherein said 
step of updating the first address field of an i'th one 
of said n consecutive information items comprises the 
step of adding the contents of said i'th element of said 
table to said first address field of said i'th one of 
said n consecutive information items. 

13. A method according to claim 1, wherein said 
step of updating the first address field comprises the 
step of updating the first address field of each given 
information item in said n consecutive information items 
to a value which depends upon a third pointer (codeA, 
datcLA) , the value in said third pointer (codeA, dataA) 
remaining constant for said entire step of updating the 
first address field of n consecutive ones of said 
information items. 

14. A method according to claim 13, wherein said 
step of updating the first address fields of each given 
information item comprises the step of adding the 
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contents of said third pointer (codeA, dataA) to said 
first address field of said given information item. 

15. A method according to claim 13, further 
comprising the step of selecting said third pointer 
(codeA, dataA) in dependence upon said relocation 
instruction (e.g. CODE, DATA) . 

16. A method for loading a relocatable file into a 
computer system memory for execution, said file having 
a plurality of information items to be loaded into said 
memory, certain ones of said information items having a 
respective first address field containing a relocatable 
address, said file further having a plurality of 
relocation instructions, said relocation instructions 
being of different types, comprising the steps of: 

retrieving one of said relocation instructions from 
a location dependent upon a first pointer (reloc) ; 

updating said first pointer (reloc) in conjunction 
with each repetition of said step of retrieving; 

performing, if said retrieved relocation instruction 
is in a first class of relocation instructions (e.g. 
DDAT, CODE, DATA, DESC, DSC2 , VTBL, SYMR, SYMB , SECN, 
LSECN) , a relocation operation on at ■ least one of said 
information items, the first of said at least one of 
said information items being designated by a second 
pointer (rAddr) , said relocation operation including 
steps which depend upon the type of said retrieved 
relocation instruction; 

updating said second pointer (rAddr) in conjunction 
with each performance of one of said relocation 
operations on said at least one of said information 
items, and 
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repeating said seeps of retrieving, updating said 
first pointer (reloc) , performing, and updating said 



17. A method according to claim 16, wherein said' 
retrieved relocation instruction is in a second class of 
relocation instructions (e.g. DDAT, CODE, DATA, DESC, 
DSC2, VTBL, SYMR) which is within said first class, and 
wherein said steps of said relocation operation comprise 
the step of updating the first address field of n 
consecutive ones of said information items, the 
beginning of said n consecutive ones of said information 
items being designated by said second pointer (rAddr) , 
n being specified in said retrieved relocation 
instruction . 

18. A method according to claim 17, wherein said 
retrieved relocation instruction is in a third class of 
relocation instructions (e.g. DDAT, CODE, DATA, DESC, 
DSC2, VTBL) which is within said second class, and 
wherein said step of updating the first address field 
comprises the step of updating the first address field 
of each given information item in said n consecutive 
ones of said information items to a value which depends 
upon the prior contents of such first address field and 
upon a third pointer (codeA, dataA) , the value in said 
third pointer (codeA, dataA) remaining constant for said 
entire step of updating the first address field of n 
consecutive information items. 

19. A method according to claim 17, wherein said 
retrieved relocation instruction is in a fourth class of 
instructions (e.g. DDAT) within said third class, and 
wherein said step of updating the first address field 
comprises the step of adding the contents of a third 



second pointer (rAddr) . 
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pointer (dataA) to said first address field of each of 
said n consecutive information items , 

further comprising the step , if said retrieved 
relocation instruction is in said fourth class of 
instructions (e.g. DDAT) , incrementing said second 
pointer (rAddr) by a value specified in said retrieved 
relocation instruction prior to said step of performing 
a relocation operation . 

20. A method according to claim 17, wherein said 
retrieved relocation instruction is in a fifth class of 
relocation instructions (e.g. CODE, DATA) within said 
second class, said fifth class including a sixth class 
of relocation instructions (e.g. CODE) and a seventh 
class of relocation instructions (e.g. DATA), and 
wherein said step of updating the first address field 
comprises the steps of : 

adding the contents of a third pointer (codeA) to 
said first address field of each of said n consecutive 
information items if said retrieved relocation 
instruction is in said sixth class of instructions (e.g. 
CODE) ; and 

adding the contents of a fourth pointer (dataA) to 
said first address field of each of said n consecutive 
information items if said retrieved relocation 
instruction is in said seventh class of instructions 
(e.g. DATA) . 

21. A method according to claim 17, wherein said 
retrieved relocation instruction is in an eighth class 
of instructions (e.g. VTBL) within said second class, 

wherein said step of updating the first address 
field comprises the step of adding the contents of a 
fourth pointer (dataA) to the first word of each of said 
n consecutive information items, each of said n 
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consecutive information items having a length of two 
words , 

and wherein said step of updating said second 
pointer (rAddr) comprises the step of incrementing said 
second pointer by 2n words . 

22. A method according to claim 17, wherein said 
retrieved relocation instruction is in a ninth class of 
relocation instructions {e.g. DESC, DSC2) within said 
second class, 

wherein said step of updating the first address 
field comprises the step of adding the contents of a 
third pointer (codeA) to said first address field of 
each of said n consecutive information items, 

wherein each of said n consecutive information items 
further has a second address field, 

said steps of said relocation operation further 
comprising the step of adding the contents of a fourth 
pointer (dataA) to said second address field of each of 
said n consecutive information items . 

23. A method according to claim 17, wherein said 
retrieved relocation instruction is in a tenth class of 
instructions (e.g. SYMR) , and wherein said step of 
updating the first address field comprises the steps of: 

adding, to the first address field of each i'th one 
of said n consecutive information items, the contents of 
a respective i'th element of n consecutive elements in 
a table (imports) , the beginning of said n consecutive 
elements in said table being determined from a count 
value (rSyml) ; and 

adding n to said count value (rSyml) . 

24. A method according to claim 16, wherein said 
retrieved relocation instruction is in an eleventh class 
of relocation instructions (e.g. SYMB, SECN, LSECN) 
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within said first class, and wherein said steps of said 
relocation operation comprise the step of updating the 
first address field of the information item designated 
by said second pointer (rAddr) . 

25. A method according to claim 24, wherein said 
step of updating the first address field comprises the 
step of adding to said first address field the contents 
of a pointer (imports [IDX9] , regions [IDX9] , 
regions [CNT22] ) specified by said retrieved relocation 
instruction . 

26. A method according to claim 25, wherein said 
retrieved relocation instruction is in a twelfth class 
instructions (e.g. SYMB) within said eleventh class, 
wherein said pointer specified by said retrieved 
relocation instruction is given as the i'th element of 
a table (imports) , i being specified in said retrieved 
relocation instruction, said method further comprising 
the step of maintaining a count value (rSyml) indicating 
the next element of said table (imports) to use, said 
steps of said relocation operation further comprising 
the step of storing i + 1 as said count value (rSyml) 
for subsequent use. 

27. A method according to claim 16, further 
comprising the step of, if said retrieved relocation 
instruction is in a thirteenth class of instructions 
(e,g. GDIS, DTIS, LCDIS, LDTIS) , updating the contents 
of said third pointer (codeA, dataA) to a value which 
depends on said retrieved relocation instruction. 

28. A method according to claim 27, wherein said 
plurality of information items is to be loaded into a 
plurality of regions of said memory, each of said 
regions containing contiguously a respective group of at 
least one of said plurality of information items, each 
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of said regions having a respective designation and a 
respective start address, 

and wherein said step of updating the contents of 
said third pointer (codeA, dataA) comprises the step of 
storing as said third pointer (codeA, datciA) the start 
address of one of said regions whose designation is 
specified in said retrieved relocation instruction. 

29. A method according to claim 16, further 
comprising the step of, if said retrieved relocation 
instruction is in a fourteenth class of instructions 
{e.g. DELTA), adding a value to said second pointer 
(rAddr) , which value is specified in said retrieved 
relocation instruction. 

30. A method according to claim 16, further 
comprising the step of, if said retrieved relocation 
instruction is in a fifteenth class of instructions 
(e.g. LABS) , storing a value specified in said retrieved 
relocation instruction as said second pointer (rAddr) . 

31. A method according to claim 16, further 
comprising the step of, if said retrieved relocation 
instruction is in a sixteenth class of instructions 
(e.g. RPT, LRPT) , the step of updating the contents of 
said first pointer (reloc) to a value which depends upon 
said retrieved relocation instruction. 

32. A method according to claim 31, wherein said 
step of updating the contents of said first pointer 
(reloc) comprises the step of subtracting a value j from 
said contents of said first pointer (reloc) , j being 
specified in said retrieved relocation instruction. 

33. A method according to claim 32, further 
comprising the steps of: 

maintaining a count of the number of repetitions of 
said step of subtracting; and 
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inhibiting said seep of subtracting after m of such 
repetitions , m being specified in said retrieved 
relocation instruction . 

34. A method according to claim 23, further- 
comprising the step of , if said retrieved relocation 
instruction is in an seventeenth class of instructions 
(e.g. LSYM) , updating said count value (rSyml) to a 
value which depends upon said retrieved relocation 
instruction . 

35. A method according to claim 26, further 
comprising the step of, if said retrieved relocation 
instruction is in an seventeenth class of instructions 
(e.g. LSYM) , updating said count value (rSyml) to a 
value which depends upon said retrieved relocation 
instruction . 

36. An information storage medium carrying a 
relocatable file for loading into a computer system 
memory, said file having a plurality of information 
items to be loaded into said memory, certain ones of 
said information items having a respective first address 
field containing a relocatable address, said file 
further having a plurality of relocation instructions, 
including a first instruction which specifies updating 
of the first address field of n consecutive ones of said 
information items, n being specified in said first 
relocation instruction. 

37. A medium according to claim 36, wherein said 
relocation instructions further include a second one of 
said relocation instructions, said second relocation 
instruction specifying updating of an address field of 
m consecutive ones of said information items, m being 
specified in said second relocation instruction. 
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38. A medium according to claim 37, wherein said 
first and second relocation instructions are of 
respective different first and second types, and wherein 
the updating specified by relocation instructions of 
said first type is different from the updating specified 
by relocation instructions of said second type. 

39. A medium according to claim 37, wherein said 
first and second relocation instructions have different 
byte lengths. 

40. A medium according to claim 36, wherein said 
first relocation instruction does not specify which of 
said n consecutive information items are to have an 
address field updated. 

41. A medium according to claim 36, wherein said 
updating of the first address field of said n 
consecutive information items as specified by said first 
relocation instruction includes updating the first 
address field of each given information item in said n 
consecutive information items to a value which depends 
upon a respective given element of a table (imports) . 

42. A medium according to claim 41, wherein said 
updating of the first address field of a given 
information item as specified by said first relocation 
instruction includes adding the contents of said 
respective given element of said table to said first 
address field of said given information item. 

43. A medium according to claim 36, wherein said 
updating the first address field as specified by said 
first relocation instruction includes updating the first 
address field of each given information item in said n 
consecutive information items to a value which depends 
upon a third pointer (codeA, dataA) , the value in said 
third pointer (codeA, dataA) remaining constant for said 
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entire updating of the first address field of n 
consecutive ones of said information items. 

44. A medium according to claim 43, wherein said 
updating of the first address fields of each given 
information item as specified by said first relocation 
instruction includes adding of the contents of said 
third pointer (codeA, dataA) to said first address field 
of said given information item. 

45. A medium according to claim 43, wherein said 
updating the first address field as specified by said 
first relocation instruction further includes a 
specification of which of a predetermined plurality of 
pointers constitutes said third pointer. 

46. An information storage medium carrying a 
relocatable file for loading into a computer system 
memory, said file having a plurality of information 
items to be loaded into said memory, certain ones of 
said information items having a respective first address 
field containing a relocatable address, said file 
further having a plurality of relocation instructions, 
said relocation instructions being of different types, 
a first class of said relocation instructions (e.g. 
DDAT, CODE, DATA, DESC , DSC2 , VTBL , SYMR, SYMB, SECN, 
LSECN) specifying a relocation operation on at least one 
of said information items, said at least one of said 
information items not being designated in said 
relocation instructions in said first class of 
relocation instructions, the type of said relocation 
instruction identifying the relocation operation to be 
performed on said at least one of said information 
items . 

47. A medium according to claim 46, wherein the 
relocation operation identified by each given relocation 
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instruction in a second class of relocation instructions 
(e.g. DDAT, CODE, DATA, DESC, DSC2 , VTBL, SYMR) which is 
within said first class, includes updating the first 
address field of n consecutive ones of said information 
items, said n consecutive ones of said information items 
not being designated in said given relocation 
instruction, n being specified in said given relocation 
instruction . 

48. A medium according to claim 47, wherein the 
updating to be performed if said given relocation 
instruction is in a third class of relocation 
instructions (e.g. DDAT, CODE, DATA, DESC, DSC2 , VTBL) 
which is within said second class, includes updating the 
first address field of each given information item in 
said n consecutive ones of said information items to a 
value which depends upon the prior contents of such 
first address field and upon a third pointer (codeA, 
dataA) , the value in said third pointer (codeA, dataA) 
remaining constant for said entire updating of the first 
address field of n consecutive information items. 

49. A medium according to claim 47, wherein the 
updating to be performed if said given relocation 
instruction is in a fourth class of instructions (e.g. 
DDAT) within said third class, includes adding of the 
contents of a third pointer (dataA) to said first 
address field of each of said n consecutive information 
items , 

said given relocation instruction further specifying 
a value by which a second pointer (rAddr) pointing to 
said information items is to be incremented prior to 
said updating. 

50. A medium according to claim 47, wherein the 
updating to be performed includes adding the contents of 
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a third pointer (codeA) to said first address field of 
each of said n consecutive information items if said 
given relocation instruction is in a sixth class of 
relocation instructions (e.g. CODE) which is within said 
second class, and adding the contents of a fourth 
pointer (dataA) to said first address field of each of 
said n consecutive information items if said given 
relocation instruction is in a seventh class of 
instructions (e.g. DATA) which is within said second 
class . 

51. A mediiiin according to claim 47, wherein each of 
said n consecutive information items includes a first 
word and a second word, wherein said updating to be 
performed if said given relocation instruction is in an 
eighth class of instructions (e.g. VTBL) within said 
second class, includes adding the contents of a fourth 
pointer (dataA) to the first word of each of said n 
consecutive information items, the relocation operation 
identified by relocation instructions in said eighth 
class of instructions not including any updating of the 
second word of each of said n consecutive information 
items . 

52. A medium apcording to claim 47, wherein each of 
said n consecutive information items further includes a 
respective second address field containing a relocatable 
address, wherein the relocation operation identified by 
each given relocation instruction in a ninth class of 
relocation instructions (e.g. DESC, DSC2) within said 
second class, includes adding the contents of a third 
pointer (codeA) to said first address field of each of 
said n consecutive information items and adding the 
contents of a fourth pointer (dataA) to said second 
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address field of each of said n consecutive information 
items . 

53. A medium according to claim 47, wherein the 
updating to be performed if said given relocation 
instruction is in a tenth class of instructions (e.g. 
SYMR) , includes adding, to the first address field of 
each i'th one of said n consecutive information items, 
the contents of a respective i'th element of n 
consecutive elements in a table (imports). 

54. A medium according to claim 46, wherein said 
relocation operation specified by each given relocation 
instruction which is in an eleventh class of relocation 
instructions (e.g. SYMB, SECN, LSECN) within said first 
class, includes updating the first address field of the 
information item designated by a second pointer (rAddr) 
the contents of which are not specified in said 
relocation instruction . 

55. A medium according to claim 54, wherein said 
updating of the first address field includes adding to 
said first address field the contents of a pointer 
(imports [IDX9] , regions [IDX9] , regions [CNT22] ) specified 
by said given relocation instruction. 

56. A medium according to claim 46, wherein said 
plurality of relocation instructions further includes 
relocation instructions in a thirteenth class of 
instructions (e.g. DCIS, DTIS, LCDIS, LDTIS) , each 
relocation instruction in said thirteenth class of 
instructions specifying that a third one of a 
predetermined plurality of pointers (codeA, dataA) is to 
be updated, the type of each relocation instruction in 
said thirteenth class of instructions determining which 
of said predetermined plurality of pointers (codeA, 
dataA) is to be said third pointer. 
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57. A medium according co claim 56, wherein said 
plurality of information items is to be loaded into a 
plurality of regions of said memory, each of said 
regions containing contiguously a respective group of at 
least one of said plurality of information items, each 
of said regions having a respective designation and a 
respective start address, 

and wherein the updating specified by each given 
relocation instruction in said thirteenth class of 
instructions includes storing as said third pointer 
(codeA, dataA) the start address of one of said regions 
whose designation is specified in said given relocation 
instruction. 

58. A medium according to claim 46, wherein said 
relocation instructions further include a fourteenth 
class of instructions (e.g. DELTA), each relocation 
instruction in said fourteenth class of instructions 
specifying a value to be added to a second pointer 
(rAddr) into said information items. 

59. A medium according to claim 46, wherein said 
relocation instructions further include a fifteenth 
class of instructions (e.g. LABS), each relocation 
instruction in said fifteenth class of instructions 
specifying a value to be stored as a second pointer 
(rAddr) into said information items. 

60. A medium according to claim 46, wherein said 
relocation instructions further include a sixteenth 
class of instructions (e.g. RPT, LRPT) , each instruction 
in said sixteenth class of instructions specifying a 
number of times for which an immediately pr eding group 
of m of said relocation instructions are tc je repeated. 
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61* A medium according to claim 60, wherein each 
instruction in said sixteenth class of instructions 
further specifies m, 

62. An information storage medium carrying a file' 
for loading into a computer system memory, said file 
comprising : 

an input portion associating a plurality of import 
symbol names with respective import symbol numbers; and 

an export portion associating a plurality of export 
symbol naimes with respective export symbol location 
values, said export portion being distinct from said 
import portion. 

63. A medium according to claim 62, wherein said 
import portion comprises an import symbol table having 
an import table entry corresponding to each of said 
import symbol names, each of said import table entries 
having a respective location in said table and 
identifying the import symbol name to which it 
corresponds, the import symbol number associated with 
each of said import symbol names being derivable from 
the location of the import table entry corresponding to 
the import symbol name in said import symbol table. 

64. A medium according to claim 63, wherein said 
file further comprises a string table containing said 
import symbol names, and wherein each of said entries in 
said import symbol table includes an offset into said 
string table to identify the import symbol name to which 
the entry corresponds . 

65. A medium according to claim 63, wherein the 
import symbol numbers for all of said import symbol 
names collectively constitute a series of consecutive 
integers . 
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66. A medium according to claim 62, wherein each of 
said export symbol location values is selected from the 
group consisting of (a) a pointer to a location in said 
file and (b) an import symbol number . 

67. A medium according to claim 62, wherein said 
export portion comprises an export symbol table having 
an export table entry corresponding to each of said 
export symbol names, each of said export table entries 
identifying the export symbol name to which it 
corresponds and the export symbol location value 
associated with the export symbol name to which . it 
corresponds . 

68. A medium according to claim 62, wherein said 
file further comprises at least one loadable section, 
and wherein a first one of said export symbol location 
values includes a pointer to a location in a first one 
of said loadable sections. 

69. A medium according to claim 68, wherein said 
first export symbol location value further selects which 
of said at least one loadable section is said first 
loadable section. 

70. A medium according to claim 62, wherein said 
import portion comprises an import symbol table having 
an import table entry corresponding to each of said 
import symbol names, each of said import table entries 
having a respective location in said table and 
identifying the import symbol name to which it 
corresponds , the import symbol number associated with 
each of said import symbol names being derivable from 
the location of the import table entry corresponding to 
the import symbol name in said import symbol table, said 
import symbol table being distinct from said export 
symbol table. 
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71. A medium according to claim 70, wherein said 
file further comprises a plurality of import symbol 
strings located in an import symbol naime string portion 
of said file, and a plurality of export symbol strings 
located in an export symbol name string portion of said 
file, 

wherein each of said import table entries designates 
one of said import symbol name strings to identify the 
import symbol name to which the import table entry 
corresponds, 

wherein each of said export table entries designates 
one of said export symbol name strings to identify the 
export symbol name to which the export table entry 
corresponds , 

and wherein said import and export symbol name 
string portions of said file are distinct, 

72. A medium according to claim 71, wherein said 
import and export symbol name string portions are each 
distinct from said export symbol table, and wherein said 
export symbol name string portion of said file is 
located closer in said file than said import symbol name 
string portion of said file, to said export symbol 
table. 

73. A medium according to claim 72, wherein said 
import and export symbol name string portions are each 
distinct from said import symbol table, and wherein said 
import symbol name string portion of said file is 
located closer in said file than said export symbol name 
string portion of said file, to said import symbol 
table. 

74. A medium according to claim 67, wherein said 
export symbol table is hashed. 
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75. A medium according to claim 67, wherein said 
file further comprises an export slot table which 
associates each of said export symbol names with a 
predetermined subset of said export table entries. 

76. A medium according to claim 67, wherein said 
file comprises a chain entry corresponding to each of 
said export symbol names, each of said chain entries 
containing a hashword which is a first predefined 
function of the export symbol name corresponding to the 
chain entry, said chain entries being grouped into 
slots, at least one of said slots having more than one 
chain entry, each slot corresponding to a respective one 
of a plurality of hash slot values, the hash slot value 
corresponding to each given slot being a second 
predefined function of each of the export symbol names 
corresponding to a chain entry in said given slot. 

77. A medium according to claim 76, wherein said 
file further comprises an export slot table which 
associates each given one of said hash slot values with 
the chain entries in the slot which correspond to said 
given hash slot value, 

78. A medium according to claim 62, wherein said 
file further comprises a library portion associating a 
plurality of import library names with respective sets 
of said import symbol numbers, said library portion 
being distinct from said import portion and from said 
export portion. 

79. A medium according to claim 62, wherein said 
file has a plurality of information items to be loaded 
into said memory, certain ones of said information items 
having a respective first address field containing a 
relocatable address, said file further having a 
plurality of relocation instructions, including a first 
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instruction which specifies updating of the first 
address field of n consecutive ones of said information 
items, n being specified in said first relocation 
instruction . 

80. A medium according to claim 62, said file 
having a plurality of information items to be loaded 
into said memory, certain ones of said information items 
having a respective first address field containing a 
relocatable address, said file further having a 
plurality of relocation instructions, said relocation 
instructions being of different types, a first class of 
said relocation instructions (e.g. DDAT, CODE, DATA, 
DESC, DSC2, VTBL, SYMR, SYMB, SECN, LSECN) specifying a 
relocation operation on at least one of said information 
items, said at least one of said information items not 
being designated in said relocation instructions in said 
first class of relocation instructions, the type of said 
relocation instruction identifying the relocation 
operation to be performed on said at least one of said 
information items. 

81. An information storage medium carrying a file 
for loading into a computer system memory, said file 
having an export symbol table associating a plurality of 
export symbol names with respective export symbol 
location values, said export symbol table being hashed. 

82. An information storage medium carrying a file 
for loading into a computer system memory, said file 
having an export symbol table associating a plurality of 
export symbol names with respective export symbol 
location values, and an export slot table which 
associates each of said export symbol names with a 
predetermined subset of said export table entries. 
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83 . An information storage medium carrying a file 
for loading into a computer system memory, said file 
having an export symbol table associating a plurality of 
export symbol names with respective export symbol 
location values, said file further comprising a chain 
entry corresponding to each of said export symbol names, 
each of said chain entries containing a hashword which 
is a first predefined function of the export symbol name 
corresponding to the chain entry, said chain entries 
being grouped into slots, at least one of said slots 
having more than one chain entry, each slot 
corresponding to a respective one of a plurality of hash 
slot values, the hash slot value corresponding to each 
given slot being a second predefined function of each of 
the export symbol names corresponding to a chain entry 
in said given slot, 

84. A medium according to claim 83, wherein said 
file further comprises an export slot table which 
associates each given one of said hash slot values with 
the chain entries in the slot which correspond to said 
given hash slot value. 

85. A method for loading a relocatable file into a 
computer system memory for execution, said file having 
a plurality of information items to be loaded into said 
memory, a first one of said information items having an 
address field and an indication of a first import symbol 
name for use in updating said address field, comprising 
the steps of: 

loading an import file into said memory, said import 
file having a table of export symbols containing a 
plurality of export symbol slots each having a slot 
value, each of said slots in said table of export 
symbols containing at least one association associating 
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an export symbol name to a respective location value, at 
least one of said slots in said table of export symbols 
containing more than one such association, all of the 
export symbol names associated in each given one of said 
slots having the same hash slot value according to a 
predefined hash- slot -value algorithms- 
determining a first hash slot value for said first 
import symbol name according to said predefined hash- 
slot -value algorithm; 

searching for a first export symbol name in a slot 
corresponding to said first hash slot value in said 
table of export symbols in said import file, which 
export symbol name matches said first import symbol 
name ; and 

updating said address field of said first 
information item in response to the location value 
associated by said table of export symbols to said first 
export symbol name . 

86. A method according to claim 85, wherein each of 
said associations in said table of export symbols in 
said import file further associates a hashword value 
computed from the export symbol name of the association 
according to a predefined hash-word- value algorithm, 
said method further comprising the step of determining 
a first import hashword value for said first import 
.symbol name according to said hash- word- value algorithm, 
and wherein said step of searching comprises the step 
of: 

determining for successive current ones of said 
associations in said slot corresponding to said first 
hash slot value in said table of export symbols, until 
the export symbol name of said current association 
matches said first import symbol name, first whether the 
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export hashword value of said current association 
, matches said first import hashword value, and only if 
so, whether the export symbol name of said current 
association matches said first import symbol name. 

87. A method according to claim 86, wherein said 
import file further has an export slot table associating 
each given slot in said table of export symbols with the 
slot value of said given slot, said method further 
comprising the step of examing said export slot table to 
determine said slot corresponding to said first hash 
slot value. 

88. A method according to claim 85, wherein said 
import file further has an export slot table associating 
each given slot in said table of export symbols with the 
slot value of said given slot, said method further 
comprising the step of examing said export slot table to 
determine said slot corresponding to said first hash 
slot value. 

89. A method for constructing a relocatable file 
for loading into a computer system memory, said file 
having a plurality of information items to be loaded 
into said memory, certain ones of said information items 
having a respective first address field containing a 
relocatable address, comprising the steps of: 

identifying n consecutive ones of said certain 
information items; and 

writing to said file a first relocation instruction 
which specifies updating of the first address field of 
said n consecutive information items, n being specified 
in said first relocation instruction. 

90. A method according to claim 89, further 
comprising the steps of: 
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identifying m consecutive ones of said certain 
information items; and 

writing to said file a second relocation instruction 
which specifies updating of an address field of said m 
consecutive information items, m being specified in said 
second relocation instruction. 

91. A method according to claim 90, wherein said 
first and second relocation instructions are of 
respective different first and second types, and wherein 
the updating specified by relocation instructions of 
said first type is different from the updating specified 
by relocation instructions of said second type, 

92. A method according to claim 90, wherein said 
first and second relocation instructions have different 
byte lengths. 

93. A method according to claim 89, wherein said 
first relocation instruction does not specify which of 
said n consecutive information items are to have an 
address field updated. 

94. A method according to claim 89, wherein said 
updating of the first address field of said n 
consecutive information items as specified by said first 
relocation instruction includes updating the first 
address field of each given information item in said n 
consecutive information items to a value which depends 
upon a respective given element of a table (imports) , 

95. A method according to claim 94, wherein said 
updating of the first address field of a given 
information item as specified by said first relocation 
instruction includes adding the contents of said 
respective given element of said table to said first 
address field of said given information item. 
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96. A method according to claim 89, wherein said 
updating the first address field as specified by said 
first relocation instruction includes updating the first 
address field of each given information item in said n 
consecutive information items to a value which depends 
upon a third pointer (codeA, dataA) , the value in said 
third pointer (codeA, dataA) remaining constant for said 
entire updating of the first address field of n 
consecutive ones of said information items. 

97. A method according to claim 96, wherein said 
updating of the first address fields of each given 
information item as specified by said first relocation 
instruction includes adding of the contents of said 
third pointer (codeA, dataA) to said first address field 
of said given information item. 

98. A method according to claim 96, wherein said 
updating the first address field as specified by said 
first relocation instruction further includes a 
specification of which of a predetermined plurality of 
pointers constitutes said third pointer. 

99. A method for constructing a relocatable file 
for loading into a computer system memory, said file 
having a plurality of information items to be loaded 
into said memory, certain ones of said information items 
having a respective first address field containing a 
relocatable address, comprising the steps of: 

identifying a first plurality of said certain 
information items; and 

writing to said file a plurality of relocation 
instructions, said relocation instructions being of 
different types, a first class of said relocation 
instructions (e.g. DDAT, CODE, DATA, DESC, DSC2 , VTBL, 
SYMR, SYMB, SECN, LSECN) specifying a relocation 
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operation on at least one of said information items in 
said first plurality, said at least one of said 
information items in said first plurality not being 
designated in said relocation instructions in said first' 
class of relocation instructions, the type of said 
relocation instruction identifying the relocation 
operation to be performed on said at least one of said 
information items in said first plurality. 

100. A method according to claim 99, wherein the 
relocation operation identified by each given relocation 
instruction in a second class of relocation instructions 
(e.g. DDAT, CODE, DATA, DESC, DSC2 , VTBL, SYMR) which is 
within said first class, includes updating the first 
address field of n consecutive ones of said information 
items in said first plurality, said n consecutive ones 
of said information items not being designated in said 
given relocation instruction, n being specified in said 
given relocation instruction. 

101. A method according to claim 100, wherein the 
updating to be performed if said given relocation 
instruction is in a third class of relocation 
instructions (e.g. DDAT, CODE, DATA, DESC, DSC2 , VTBL) 
which is within said second class, includes updating the 
first address field of each given information item in 
said n consecutive ones of said information items to a 
value which depends upon the prior contents of such 
first address field and upon a third pointer (codeA, 
dataA) , the value in said third pointer (codeA, dataA) 
remaining constant for said entire updating of the first 
address field of n consecutive information items. 

102. A method according to claim 100, wherein the 
updating to be performed if said given relocation 
instruction is in a fourth class of instructions (e.g. 
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DDAT) within said third class, includes adding of the 
contents of a third pointer (dataA) to said first 
address field of each of said n consecutive information 
items , 

said given relocation instruction further specifying 
a value by which a second pointer (rAddr) pointing to 
said information items is to be incremented prior to 
said updating. 

103. A method according to claim 100, wherein the 
updating to be performed includes adding the contents of 
a third pointer (codeA) to said first address field of 
each of said n consecutive information items if said 
given relocation instruction is in a sixth class of 
relocation instructions (e.g. CODE) which is within said 
second class, and adding the contents of a fourth 
pointer (dataA) to said first address field of each of 
said n consecutive information items if said given 
relocation instruction is in a seventh class of 
instructions (e.g. DATA) which is within said second 
class . 

104. A method according to claim 100, wherein each 
of said n consecutive information items includes a first 
word and a second word, wherein said updating to be 
performed if said given relocation instruction is in an 
eighth class of instructions (e.g. VTBL) within said 
second class, includes adding the contents of a fourth 
pointer (dataA) to the first word of each of said n 
consecutive information items, the relocation operation 
identified by relocation instructions in said eighth 
class of instruction^ not including any updating of the 
second word of each of said n consecutive information 
items , 
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105. A method according to claim 100, wherein each 
of said n consecutive infoirmation items further includes 
a respective second address field containing a 
relocatable address, wherein the relocation operation 
identified by each given relocation instruction in a 
ninth class of relocation instructions (e.g. DESC, DSC2) 
within said second class, includes adding the contents 
of a third pointer (codeA) to said first address field 
of each of said n consecutive infoirmation items and 
adding the contents of a fourth pointer (dataA) to said 
second address field of each of said n consecutive 
information items . 

106. A method according to claim 100, wherein the 
updating to be performed if said given relocation 
instruction is in a tenth class of instructions (e.g. 
SYMR) , includes adding, to the first address field of 
each i'th one of said n consecutive information items, 
the contents of a respective i'th element of n 
consecutive elements in a table (imports) . 

107. A method according to claim 99, wherein said 
relocation operation specified by each given relocation 
instruction which is in an eleventh class of relocation 
instructions (e.g. SYMB, SECN, LSECN) within said first 
class, includes updating the first address field of the 
information item designated by a second pointer (rAddr) 
the contents of which are not specified in said 
relocation instruction . 

108. A method according to claim 107, wherein said 
updating of the first address field includes adding to 
said first address field the contents of a pointer 
(imports [IDX9] , regions tIDX9] , regions [CNT22 ] ) specified 
by said given relocation instruction. 
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109. A method according to claim 99, wherein said 
plurality of relocation instructions further includes 
relocation instructions in a thirteenth class of 
instructions {e.g. DCIS, DTIS, LCDIS, LDTIS) , each 
relocation instruction in said thirteenth class of 
instructions specifying that a third one of a 
predetermined plurality of pointers (codeA, dataA) is to 
be updated, the type of each relocation instruction in 
said thirteenth class of instructions determining which 
of said predetermined plurality of pointers (codeA, 
dataA) is to be said third pointer. 

110. A method according to claim 109, wherein said 
plurality of information items is to be loaded into a 
plurality of regions of said memory, each of said 
regions containing contiguously a respective group of at 
least one of said plurality of information items, each 
of said regions having a respective designation and a 
respective start address, 

and wherein the updating specified by each given 
relocation instruction in said thirteenth class of 
instructions includes storing as said third pointer 
(codeA, dataA) the start address of one of said regions 
whose designation is specified in said given relocation 
instruction. 

111. A method according to claim 99, wherein said 
relocation instructions further include a fourteenth 
class of instructions (e.g. DELTA) , each relocation 
instruction in said fourteenth class of instructions 
specifying a value to be added to a second pointer 
(rAddr) into said information items. 

112. A method according to claim 99, wherein said 
relocation instructions further include a fifteenth 
class of instructions (e.g. LABS) , each relocation 
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instruction in said fifteenth class of instructions 
specifying a value to be stored as a second pointer 
(rAddr) into said information items. 

113. A method according to claim 99, wherein said 
relocation instructions further include a sixteenth 
class of instructions (e.g, RPT, LRPT) , each instruction 
in said sixteenth class of instructions specifying a 
number of times for which an immediately preceding group 
of m of said relocation instructions are to be repeated. 

114. A method according to claim 113, wherein each 
instruction in said sixteenth class of instructions 
further specifies m. 

115. A method for constructing an object code file 
containing a plurality of exports capable of being 
imported into an importing file to be loaded into a 
computer system memory, each of said exports having an 
export symbol name and a respective export symbol 
location value, comprising the steps of: 

writing said exports to said object code file; and 
writing to said object code file an export symbol 
table associating said export symbol names with said 
export symbol location values, said export symbol table 
being hashed. 

116. A method for constructing an object code file 
containing a plurality of exports capable of being 
imported into an importing file to be loaded into a 
computer system memory, each of said exports having an 
export symbol name and a respective export symbol 
location value, comprising the steps of: 

writing to said object code file an export symbol 
table having a plurality of entries each associating a 
respective one of said export symbol names with its 
respective export symbol location value; and 
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writing to said object code file an export slot 
table which associates each of said export symbol names 
with a predetermined subset of said export table 
entries. 

117. A method for constructing an object code file 
containing a plurality of exports capable of being 
imported into an importing file to be loaded into a 
computer system memory, each of said exports having an 
export symbol name and a respective export symbol 
location value, comprising the steps of: 

writing to said obj ect code file an export symbol 
table associating each of said export symbol names with 
its respective export symbol location value; 

writing to said obj ect code file a chain entry 
corresponding to each of said export symbol names, each 
of said chain entries containing a hashword which is a 
first predefined function of the export symbol name 
corresponding to the chain entry , said chain entries 
being grouped into slots, at least one of said slots 
having more than one chain entry, each slot 
corresponding to a respective one of a plurality of hash 
slot values , the hash slot value corresponding to each 
given slot being a second predefined function of each of 
the export symbol names corresponding to a chain entry 
in said given slot. 

118. A method according to claim 117, further 
comprising the step of writing to said object code file 
an export slot table which associates each given one of 
said hash slot values with the chain entries in the slot 
which correspond to said given hash slot value. 

119. A method for constructing a file to be loaded 
into a computer system memory, comprising the steps of: 
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identifying in said file a data pattern consisting 
of RCounta repetitions of a first subpattern, followed 
by a second subpattern of length Countl bytes, each of 
said repetitions of said first subpattern consisting of' 
said second subpattern followed by DCount2 additional 
bytes of data; and 

writing to said file a data expansion instruction 
comprising Countl, DCount2, RCount3, said first 
subpattern, and each of said additional bytes of data. 

120. An information storage medium carrying a file 
for loading into a computer system memory, said file 
having a plurality of data expansion instructions, 
including a first instruction which specifies the 
creation of a data pattern consisting of RCount3 
repetitions of a first subpattern, followed by a second 
subpattern of length Countl bytes, each of said 
repetitions of said first subpattern consisting of said 
second subpattern followed by DCount2 additional bytes 
of data, said first instruction identifying Countl, 
DCount2, RCount3, said first subpattern, and each of 
said additional bytes of data. 

121. A method for loading a file into a computer 
system memory, said file having a plurality of data 
expansion instructions, comprising the steps of: 

retrieving a first one of said data expansion 
instructions; and 

creating a data pattern consisting of RCount3 
repetitions of a first subpattern, followed by a second 
subpattern of length Countl bytes, each of said 
repetitions of said first subpattern consisting of said 
second subpattern followed by DCount2 additional bytes 
of data, said first instruction identifying Countl, 
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DCount2 , RCount3, said first subpatcern, and each of 
said additional bytes of data. 
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